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Ultra- fast stochastic cooling (i.e., on microsecond time-scales) would be desirable in certain appli- 
cations, for example, in order to boost final luminosity in a muon collider or neutrino factory, where 
even with relativistic dilation, the short particle lifetimes severely limit the total time available to 
reduce beam phase space. But fast cooling requires very high-bandwidth amplifiers so as to limit the 
incoherent heating effects from neighboring particles. A method of transit-time optical stochastic 
cooling has been proposed which would employ high-gain, high-bandwidth, solid-state lasers to am- 
plify the spontaneous radiation from the charged particle bunch in a strong-field magnetic wiggler. 
This amplified light is then fed back onto the same bunch inside a second wiggler, with appropriate 
phase delay to effect cooling. But before amplification, the usable signal from any one particle 
is quite small, on average much less than one photon for each pass through the wiggler, suggest- 
ing that the radiation should be treated quantum mechanically, and raising doubts as to whether 
this weak signal even contains sufficient phase information necessary for cooling, and whether it 
can be reliably amplified to provide the expected cooling on each pass. A careful examination of 
the dynamics, where the radiation and amplification processes are treated quantum mechanically, 
indicates that fast cooling is in principle possible, with cooling rates which essentially agree with 
classical calculations, provided that the effects of the unavoidable amplifier noise arising from quan- 
tum mechanical uncertainty are included. Thus, in this context quantum mechanical uncertainties 
do not present any insurmountable obstacles to optical cooling, nor do they lead to any significant 
differences than in the purely classical regime, but do establish a lower limit on cooling rates and 
achievable emittances. 



A. A violent order is disorder; and 

B. A great disorder is order. These 
Two things are one. 

Wallace Stevens 
"Connoisseur of Chaos" 

In all chaos there is a cosmos, in all disorder a secret order. 

Carl Jung 

No phenomenon is a phenomenon until it is an observed phenomenon. 

John Archibald Wheeler 



I. INTRODUCTION AND OVERVIEW 



Developed by Simon van der Meer and collaborators (see [TJ|5J[3] for reviews), conventional stochastic cooling using 
radio- frequency (RF) signals has achieved great success in increasing phase space density of particle bunches in storage 
rings, for heavier particles such as protons or anti-protons where synchrotron radiation damping is inefficient. Cooling 
time-scales typically range from minutes to hours. Ultra-fast stochastic cooling (i.e., on microsecond time-scales) 
would be desirable in certain applications^ , for example, to boost final luminosity in the proposed muon collider, 
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FIG. 1: Schematic of an electromagnetically-based stochastic cooling scheme. A signal containing information about particle 
phase space deviations from the reference orbit are produced in the kicker, possibly filtered, then amplified, and fed back onto 
the beam in the pickup, possibly after suitable beam-optical preparations induced on the latter. If necessary, a mixer section 
is included between cooling passes to ensure that the incoherent heating effects contribute diffusively. 



where the short particle lifetime severely limits the time available to reduce beam phase space. But fast cooling 
requires very high-bandwidth amplifiers so as to limit the incoherent heating effects from neighboring particles. A 
method of transit-time optical stochastic cooling (OSC) has been proposed E] which would employ high-gain, high- 
bandwidth, solid-state lasers to amplify the spontaneous radiation emitted from the charged particle bunch in a 
strong-field magnetic wiggler. This amplified light is then fed back onto the same bunch inside a second wiggler, with 
appropriate phase delay to effect cooling. But before amplification, the usable signal from any one particle is quite 
small, on average much less than one photon for each pass through the wiggler, suggesting that the radiation must 
be treated quantum mechanically, and raising doubts as to whether this weak signal even contains sufficient phase 
information required for cooling, and whether it can be reliably amplified to provide the needed cooling on each pass. A 
careful treatment of the dynamics, where the radiation and amplification processes are treated quantum mechanically, 
indicates that fast cooling is in principle possible, with cooling rates that essentially agree with a simple classical 
calculation, provided that the effects of the unavoidable amplifier noise arising from quantum mechanical uncertainty 
are included. Thus, quantum mechanical uncertainties do not appear to present any insurmountable obstacles to 
optical cooling, but do establish a lower limit on cooling rates and achievable emittances, so the effectiveness of such 
schemes will probably be limited by more prosaic classical concerns over the required laser power and phase control. 
These sources of noise, whether classical or quantum mechanical in origin, might be expected to act, more or less, 
much like some effective number of extra classical particles in the sample, affecting the cooling rate and equilibrium 
omittances in degree but not in kind, although they would nevertheless impose a lower bound on the achievable noise 
floor and ultimately limit the marginal effectiveness of efforts to dilute the beam density before cooling. 



II. STOCHASTIC COOLING: GENERAL FEATURES AND CONSIDERATIONS 



In stochastic cooling [U 12 E] , an interaction of a charged particle beam with a "pickup" measuring device generates 
a weak electromagnetic cooling signal containing partial information about the phase-space structure of the particles 
bunch, ideally in a non-perturbative limit, without any appreciable distortion of or back-action on the beam itself. A 
general schematic is shown in Fig. [T] If, while the beam is properly diverted and prepared, this signal is then suitably 
amplified, manipulated, and then fed back and made to interact with the same beam, in some "kicker" section or 
device, then the overall dynamics of the reduced (QD) beam phase space are non-Liouvillian, and both the longitudinal 
and transverse beam emittance can be reduced. Stochastic cooling is non-evaporative, in the sense that it does not 
decrease available phase space at the expense of removing the outlying particles; beam brightness can be increased as 
emittance is reduced. 



In essentially all stochastic cooling schemes so far proposed or implemented, in which the perturbations to certain 
particle phase space coordinates are approximately linearly proportional to some type of electromagnetic kicker fields, 
the damping time-scale r c for cooling of a particular degree of freedom, or equivalently the cooling rate t c _1 , at a 
particular time in the cooling process, can be at least roughly approximated by: 
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where f c is the frequency of passages through the cooling sections; G(t) is the net power gain of the amplifier acting 
on the pick-up signal; Bit) is a dimensionless positive parameter, depending on both the nature of the cooling scheme 
and on the current particle phase space distribution, and which represents the cooling effects arising from interaction 
with the feedback signal in the kicker; and 

A(t) = A + A 1 (N S + N n ) + ---, (2) 

represents the heating effects due to each particle's interaction with its own and neighboring particles' signals, where 
A Q arises from self-fields and A\ and higher-order terms are due to fields from neighboring particles; N n is a measure of 
extra noise introduced in the kicker signal by the amplifier, expressed as an equivalent number of extra particles; and 
N a is known as the effective sample size, or number of samples, and represents the effective number of particles with 
whose kicker signal a given particle also interacts in addition to its own amplified self-field. Because the pick-up and 
amplifier system has finite bandwidth and therefore finite time-response, the signals of some number of neighboring 
particles always corrupt a given particle's signal, decreasing the effectiveness of cooling. In fact, the first term of 
is the drift, or coherent term, and arises solely from the interaction of each particle with its own kicker signal, which 
can be the only source of actual cooling; while the second term is the diffusive, or incoherent, contribution resulting 
primarily from amplifier noise and the interaction of a given particle with signals from the roughly N s other particles 
present on average in its sample, and contributes only to heating the beam, as is apparent from its negative sign. 
(The incoherent term typically also includes a contribution from the self-field, in order to account for the tendency 
of particles far out in the tails of the distribution to receive too large of a correcting kick and over-shoot the target 
trajectory.) 

Because the incoherent heating contribution in (JT|) grows faster with amplifier gain than the coherent term, at any 
time there is then some locally optimal value of the gain, 

G(t )~{«} ! (3) 
which maximizes the instantaneous cooling rate then given by 

» \f,%- (4) 

Typically, this locally optimal gain will typically start relatively high when the beam is noisy and the coherent 
corrections are large, and then tend to decrease as the beam cools and approaches an asymptotic distribution of 
finite emittance, in which the cooling and heating terms just balance. Note, however, because the range of possible 
subsequent cooling rates depend on the current particle phase space distribution, which in turn depends on the history 
of past cooling, locally optimal cooling rates do not necessarily result in global optima, i.e., in the fastest possible 
overall cooling achievable in a prescribed time interval. The globally optimal cooling history is an interesting but 
non-trivial problem in nonlinear control theory, left for future research. 



III. REQUIREMENTS ON AND USES FOR FAST STOCHASTIC COOLING 



Both the achievable cooling rates and asymptotic equilibrium emittances therefore depend both on the absolute 
power delivered by the feedback signal, and on the relative power in the usable "coherent" pickup signal from any 
single particle, which contains the phase space information necessary for cooling, as compared to the corrupting 
"incoherent" signal arising from nearby particles or from noise in the amplifier, which actually contributes to heating 
during feed-back. 

In order to increase the cooling rate and typically simultaneously decrease the equilibrium emittance achievable, the 
incoherent term represented by A(t) must therefore be made smaller. From the form of A(t) we can see that fast 
cooling times therefore require that the amplifier noise and effective sample size both be made as small as possible. 
The sample size will scale like 



N s — p b min [ircrl ± , S c ] VqAuj 
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where pb = va i h Lb is the spatial number density of particles in the bunch, ab± is the transverse beam radius, 

is the longitudinal beam length, v = c/3 is the mean longitudinal beam velocity, S c is a measure of the transverse 
coherence area of the kicker fields, and Aw is the limiting bandwidth of the pick-up/amplifier /kicker system. 

Rapid cooling will therefore require relatively low beam densities and high-bandwidth, high-peak but variable-gain, 
low-noise amplifiers. Existing stochastic cooling schemes relying on radio-frequency or microwave technology are 
limited by the O(GHz) bandwidths available for high-gain amplifiers at these frequencies, and typical cooling time- 
scales range from minutes to hours. 

Yet much faster cooling might be necessary or desirable in some situations - for example, in order to achieve ultra-high 
luminosity proton beams, because phase space reduction can be offset by particle losses due to collisions, diffusion, etc, 
in the cooling ring at long time-scales. Typically time-scales for current radio-frequency (RF) stochastic cooling are 
Trf-sc ~ O(10 2 s) or O(10 3 s), while with realistic technology OSC might achieve t osc ~ 0(1 s). For electron beams, 
OSC can also work at lower energies where synchrotron damping is Inefficient. Rapid cooling for any proposed muon 
collider will be essential, because of the finite lifetime of the muons; all stages of particle beam production, collection, 
collimation, acceleration, cooling, and experimentation must take place in only a few lab-frame decay times, which 
for muons at O(10 2 Gev) energies is only 0(1 ms). 

Such ultra-fast stochastic cooling, on microsecond time-scales, would require beam densities much lower than those 
typically achieved for particle beams of useful current with bunch sizes chosen for acceleration in RF structures, and 
would require amplifiers which can achieve high gain over very broad bandwidths with minimal added noise. Some 
beam stretching and subsequent compression can be used to suitably dilute the beam density during cooling and 
restore bunch sizes after cooling, and can be achieved in a stable, essentially reversible manner using conventional 
beam optics, but sufficiently broad gain bandwidths cannot be achieved with existing RF technology. Barring any 
unforeseen breakthroughs, fast cooling will require moving to optical wavelengths, where solid-state lasers amplifiers 
(such as those using Ti:Sapphirc crystals) have achieved high gain over O(THz) bandwidths centered around 0(1 /zm) 
wavelengths. Because of the high gain and high bandwidth achievable, optical stochastic cooling shows great promise, 
but also poses significant technological challenges. At such extremely fast time-scales, the pick-up signal cannot be 
manipulated electronically, but must be suitably collected, controlled, amplified, and re-directed into the kicker for 
feedback entirely through optical means; in order to reduce longitudinal emittance, transverse optical fields must be 
made to effect longitudinal momentum kicks, requiring very high gain; and particle beam optics must control particle 
positions within a fraction of an optical wavelength, presumably through some active monitoring and feedback. These 
pose important questions and difficult challenges, but none in themselves are feared to invalidate the possibility of 
OSC in principle. Here we focus primarily on a fundamental question of principle that has been raised, namely 
whether the wiggler signal from the beam in like regimes of operation contains adequate information to cool quickly 
or even cool at all, or instead whether it may be hopelessly corrupted by quantum "noise" or "fluctuations." 



IV. WHY CONSIDER A MUON BEAM? 



The possibility of a muon collider has received significant attention in the past decade or so. The muon is a fundamental 
particle that has been little studied at high energies in controlled experiments. In a muon-muon collider, lepton 
physics similar to that studied in linear electron-positron colliders might be pursued, but because with a rest mass 
of to m = 105.7 MeV/c 2 , the muon is ~ 207 times heaver than the electron, so synchrotron radiation is comparatively 
suppressed, and muons can be accelerated and stored in circular rings at high energies (~ O(10 2 GeV)), as opposed to 
electrons or positrons which require linear accelerators. The higher mass also suppresses the so-called "beamstrahlung" 
effects which can lead to energy loss and/or energy spread, so larger bunch sizes and higher luminosities arc in principle 
possible, while radiative corrections are smaller. In terms of their potential for particle creation, collisions between 
leptons that seem t behave as point particles are intrinsically more efficient than collision between baryonic particles 
like protons with significant internal structure. The larger mass of the muon also translates into larger cross sections 
for certain interactions, especially for Higgs production, and precise measurements of the muon lifetime or g — 2 factor, 
or searches for a muon electric dipole moment (EDM) or for lcpton-flavor-violating decays offer promising avenues 
upon which to search for signatures of SUSY or other physics beyond the standard model. 

The catch of course is that muons are unstable. They must be created through pion capture, so intense sources are 
expensive and produce initial beams with poor collimation (high transverse emittance) and large energy spread (large 
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Damping rings 




FIG. 2: Schematic of a system for fast transit-time optical stochastic cooling. The stretcher/compressor ring, together with 
the linacs, expands the beam to lower the density before cooling, then reversibly compresses it after cooling. The cooling ring 
may contain one or more cooling sections. 

longitudinal emittance). Even with time dilation in the lab frame corresponding to relativistic factors of 7 ~ O(10 3 ), 
a proper lifetime of only of r M ss 2.2 /is leaves only a matter of milliseconds to collimate, manipulate, cool, accelerate, 
and collide a muon bunch. This might in part be turned to advantage by optimizing the ring not for muon collisions 
as such but instead for the production of a highly-collimated, high-flux, terrestrially-based source of neutrinos via 
spontaneous decay of the muons. 

In either mode of operation, as a muon-collider or a neutrino factory, the ring would pose many severe technological 
challenges, including the need for ultra-fast and intensive cooling, with a characteristic damping time r damp ~ O(10 -6 s) 
or less. Ionization-based rapid transverse cooling, as well as longitudinal cooling through shaped absorbers and/or 
emittance exchange, will be necessary, but transit-time optical stochastic cooling has been proposed as a possible 
means, after the beam is already collimated and highly relativistic, to boost final luminosity beyond that which can 
be achieved by ionization cooling alone, which is limited by multiple scattering and trade-offs between transverse and 
longitudinal emittance effects. 



V. TRANSIT-TIME OPTICAL COOLING 



Zolotorev, et al. [5) |B] have proposed and explored a possible method of ultra- fast transit-time optical stochastic cooling, 
in which both the pick-up and kicker consist of large-field magnetic wigglers; as shown schematically in Fig. [2] In 
the pick-up magnetic wiggler, Lorentz forces will produce transverse quiver motion of the charged beam particles, 
which in turn generates a small amount of spontaneous synchrotron radiation. This wiggler radiation is then collected 
and greatly amplified in a low-noise, solid state optical amplifier system and directed into the second wiggler. While 
the light is being amplified, the particles are directed through a bypass lattice, whose beam optics are designed so 
that each particle receives a time-of-flight delay proportional to the deviation of its longitudinal and/or transverse 
phase-space coordinates from the desired reference orbit. Particles then rejoin the amplified light in the kicker wiggler, 
where they again undergo transverse quivering with nearly the same polarization and at nearly the same frequency 
as the electric field of the optical radiation, resonantly exchanging some amount of energy, with the magnitude and 
sign of the net energy kick depending on the relative phase between particle quiver and field carrier oscillation. 

Actual cooling can be effected only by the interaction of each particle with its own amplified field, and, if the transit- 
time in the bypass lattice is adjusted so that the delay relative to the self-field is proportional to the longitudinal 
momentum deviation, then the interaction in the kicker can produce a restoring force leading to reduced momentum 
spread. If particle time-of-flight is manipulated so as to also depend on a transverse betatron coordinate, then strong 
dispersion in the lattice in the region of the kicker wiggler can also lead to transverse emittance reduction in that 
direction, and alternating the polarization of the wigglers or rotating the transverse phase space of the particle beam 
can then result in cooling the full transverse phase space. 



Cooling is therefore critically sensitive to particle phase relative to the optical signal. Between the pickup and 
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the kicker, particle positions must be carefully controlled, within a fraction of the micron-scale optical wavelength, 
presumably through active feedback on the particle beam optics. As cooling proceeds and deviations from the ideal 
orbit decrease, continued efficient cooling demands that the amplifier gain be decreased, and that the particle beam 
optics be adjusted in the bypass, if not continuously then at least intermittently, so that the probable range of particle 
deviations continues to be mapped into approximately one-half of an optical period in time-of-flight variation. 

Between one cooling pass and the next, efficient cooling demands that the lattice must be designed to provide good 
mixing, or effective "randomization" of particles within each bunch, so that the "incoherent" signal arising from the 
sample particles neighboring any given particle is really incoherent, becoming effectively randomized for each cooling 
kick. Because of the intrinsically low signal-to-noise ratios (SNRs), stochastic cooling can really only work at all 
because the coherent drift term acts during every pass as a restoring force on average, tending to kick each particle 
towards the desired reference orbit on every pass, while the heating terms, although always present, ideally are more 
or less random from pass to pass, and therefore contribute in a diffusive fashion, with no bias in a particular direction, 
and with a net standard deviation accumulating only in proportion to the square root of the number of kicks. 

However, if the net effect from all the sample particles is not independent from kick to kick, so a particle can experience 
a substantially similar "incoherent" signal for several passes, or even even partially correlated heating kicks in the 
same direction over many passes, then the resulting heating can become super-diffusive, and cooling can be greatly 
slowed or even suppressed altogether. 

Any given particle will be subject to the heating effects due to signals primarily from its neighboring sample of particles, 
so the size of the heating signal will scale inversely with the bandwidth of the cooling system. In most traditional 
stochastic cooling schemes, good mixing essentially requires that the identity of particles in a given particle's sample 
be randomized between each pass, because the heating signal may depend strongly on particle degrees-of-freedom that 
do not change significantly from one pass to the next. To shuffle the make-up of each sample between each cooling 
pass, each particle should be shifted in position by a distance comparable to a sample length or more, between exiting 
the kicker and traveling to the next passage through the pickup, in some manner that does not lead to persistent 
correlations between longitudinal particle position and the DOFs undergoing cooling. 

For the high-bandwidth, transit-time OSC method, this sample length is typically very short compared to the beam 
dimensions, about L s ~ N u Xq |65j so this kind of mixing should be even easier to achieve than in RF schemes. 

However, this amount of mixing is really more than is needed here. With transit-time optical stochastic cooling applied 
to highly relativistic beams of already moderately low emittance and energy spread, each particle will produce wiggler 
radiation of roughly the same envelope shape and spectrum, just with a different overall phase. The heating term for 
any given particle consists essentially of a non-stationary shot noise, the superposition of about N s , nearly-identical 
wave-packets of duration N u Xq, but with essentially random phases. Since the phase-space information useful for 
cooling is also encoded in the phases of the signals, good mixing in this context does not really require that particles 
move between samples between kicker and pickup, but can be achieved merely by assuring that longitudinal particle 
positions within a sample are effectively randomized, or equivalently that the relative distances between particles shift 
on the order of one optical wavelength Ao, or perhaps a little more, in a suitably "random" fashion, so as to randomize 
the phases of the approximately N s wave-packets making up the incoherent kick signal in a given particle's sample. 
By "random," we here mean that, ideally, the shifts should be largely uncorrelated with relative longitudinal positions 
before the kick, but can actually depend on other degrees-of-freedom, even ones that might be actively cooled, such 
as transverse betatron coordinates or even longitudinal momentum. Because Ao is so short compared to the stretches 
of beam line between cooling sections available for mixing (typically O(10 m)), this level of sample mixing should not 
be difficult to achieve, and in fact the greater challenge will be to suppress unwanted mixing on these optical length- 
scales between the pickup and corresponding kicker. (Actually, if the longitudinal positions are initially uncorrelated 
with beam energy or betatron deviations, then the time-of-flight delays purposefully introduced in the bypass lattice 
between pickup and kicker during a single pass themselves can provide much of the needed mixing for the next pass.) 

In order to sufficiently reduce the incoherent heating effects from neighboring particles, beam density is reversibly 
lowered before cooling, and then restored following cooling. Even with high-bandwidth optical amplifiers and therefore 
relatively short sample lengths, typical densities for particle beams of interest are still much too large for microsecond- 
scale cooling with conceivable amplifier powers. RF acceleration requires reasonably short bunch lengths (0(10 cm) or 
so) for proper phasing, and high luminosity requires large bunch charge, Nf, ~ O(10 9 ) or even higher if possible, so each 
bunch will enter the cooling section with rather high linear charge density, leading to unacceptably large sample sizes, 
on the order of N s ~ O(10 4 ) or greater. So before cooling, each bunch emerging from the acceleration sections must 
be greatly stretched, from the order of tens of centimeters to a few hundred meters, so that so that cooling can take 
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place at lower particle density and smaller sample sizes, say N s ~ O(10 2 ) After cooling, the bunch is de-compressed in 
order to restore luminosity. The reversible compression and stretching is effected using a linear accelerator (LINAC) 
and a specially designed lattice (see Fig. |2j. Because the beam is highly relativistic, all beam particles travel at 
almost the same nearly- luminal velocity, but with a distribution of relativistic momenta, so drift in free space will not 
efficiently expand the beam, but instead the beam may be first stretched by transport through a ring with very high 
momentum compaction factor, such that particles with greater longitudinal momentum will travel along longer orbits 
and lag behind those with less momentum. Assuming such stretching separately conserves longitudinal action (clearly 
something of an approximation, since the compaction ring must correlate transverse coordinates with longitudinal 
momentum), this corresponds to a simple symplectic rotation in the energy-duration plane of beam phase space, so 
as it increases the beam length, it also introduces a head-to-tail energy chirp, i.e., a correlation between longitudinal 
intra-bunch longitudinal particle position and energy, and as an added bonus, decreases the effective range of relative 
energy spread, having effectively transformed part of the random energy variation into an ordered energy correlation. 
The chirp would tend to impede particle mixing between cooling passes, and so is removed by applying a suitably 
ramped current in an induction LINAC before the beam passes into the cooling ring proper. The beam stretching 
and energy compression is essentially completely Hamiltonian, analogous to the adiabatic expansion of a gas, and 
can be reversed after cooling, when a complementary energy chirp is introduced in the beam by another LINAC, 
and the beam is de-compressed to its original length (ideally in the same compaction ring), but now hopefully with 
reduced emittance and higher luminosity. For the muon cooling scenario, the stretching and compression phases would 
consume approximately half of the allotted cooling time, i.e., typically a few to several microseconds, but the benefits 
of beam-stretching more than make up for this extra time by increasing the achievable cooling rate via reduction in 
sample size and reduction in the energy variation that must be addressed by the cooling sections. 



VI. SPONTANEOUS WIGGLER RADIATION 



The detailed dynamics of a transit-time optical cooling system will depend on the physics and form of the spontaneous 
wiggler radiation emitted in the pickups. The central wavelength Ao of the wiggler radiation is downshifted from the 
wiggler period X u itself by relativistic effects; for a planar wiggler, 



27o 2 V + 2 



Ao«S(l + ^)> (6) 



where a u — fc^a is the wiggler parameter, k u = 2tt/X u , is the lab-frame wiggler wavenumber, B u is the peak wiggler 
magnetic field, jQiric 2 is the average energy of a beam particle and q is its charge. This can be viewed as a resonance 
condition, such that the radiation slips ahead of the sources by one optical wavelength in the time the charges advance 
by one wiggler wavelength. 

Given the average beam energy, the wiggler parameter and wiggler period are chosen to approximately match the 
resonant radiation wavelength Ao |66j to the center of the gain bandwidth for the solid-state laser amplifiers, which is 
typically 0(1 /mi). The homogeneous "coherent" bandwidth [7] of such wiggler radiation is given by 

Ao, » A-c*. (7) 

where lo = ck = 2ttc/Xq is the central radiation frequency and N u is the number of undulator periods in the pickup 
wiggler. This just follows from the Fourier-Heisenberg uncertainty product, since each particle will obviously radiate 
almost exactly N u periods of radiation. 

In the so-called undulator limit, where the angular deflection of particles due to their induced quiver motion is small 
compared to the characteristic opening angle for synchrotron radiation, i.e.a n < Jq 1 , the so-called coherent component 
or coherent mode [67j of the radiation corresponds to a nearly diffraction- limited beam with angular spread 

— 1/2 

smaller than characteristic synchrotron radiation opening angle by a factor of 0(N U ), and with spot size (i.e., 
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optical beam waist) 
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satisfying the optical uncertainty principle for transverse degrees-of-freedom (DOFs). 



Assuming that the light fields in the kicker remain spatially coherent over the transverse extent of the particle beam, 
and the amplifier bandwidth Auja centered on uja is matched to the coherent bandwidth Alu of spontaneous radiation, 
centered on loq, the effective sample size N s , meaning the average number of particles contributing appreciably to the 
incoherent heating signal experienced by any other particle, will scale like: 



(10) 



where rn, — Pb&b± is the average number line density of particles in the bunch, and L s ~ N u Xq is the sample length 
length [HS] over which particles will affect their neighbors. For fast cooling, the beam must be sufficiently stretched 
so that N s is quite small compared to conventional stochastic cooling regimes, i.e.., N s < O(10 2 ) or so. 

In addition, the undulator magnets themselves will be optimized quite differently than for conventional light-source 
applications. Most applications of wiggler radiation benefit from high coherence (narrow-bandwidth) spectra, and 
therefore rely on moderate (in the context of particle physics) energy beams traveling through wigglers of moderate 
wavelength but with many periods. In optical stochastic cooling, it is desirable to produce very broad bandwidth 
optical radiation from extremely relativistic beams, so the wigglers will consist of a relatively small number (N u ~ 
O(10)) of long-period (A„ ~ O(50 cm)) magnets with field strengths essentially as high as is practical (B u ~ O(10 T)). 

From classical radiation theory and Planck's law, the average number of photons emitted per particle into the coherent 
component of undulator radiation produced in a planar wiggler may be roughly approximated as 
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where Je(x) is the £th-order ordinary Bessel function, and a = §- 



is the fine structure constant. 



While the total energy radiated per particle as determined by the Larmor formula will of course be proportional to 
the number N u of wiggler periods, the power radiated into the coherent mode remains constant (or nearly so) as N u 
varies, because the coherent bandwidth scales inversely with N u . The remaining energy is radiated too far off-axis or 
at frequencies too far from the fundamental to be particularly useful for OSC. Actually, equation (111 is not entirely 
accurate for small-period wigglers where N u < O(10), but in any case, for Ao ~ 0(1 //m), N u ~ O(10j^ and achievable 
magnetic field strengths corresponding to a u ~ O(l), the number of photons emitted per particle into the coherent 
mode should still be O(a). 



VII. "NAIVE" QUANTUM MECHANICAL CONSIDERATIONS 



So optical stochastic cooling poses serious technological challenges, at least for the very fast cooling required for 
muons, but in pondering the possibilities for such fast cooling based on wiggler radiation, serious concern arose over 
possible fundamental rather than merely practical limitations of this scheme. The apparent problem is that the actual 
cooling arises only through the interaction of each particle with its own wiggler radiation, which as we have just seen, 
is extremely weak before amplification. 

In any one pass through a pickup, each particle only radiates on average O(a) ~ 10 -2 photons that can be collected, 
amplified, and fed back to actually effect cooling, so the optical cooling signal from each particle will very weak and 
presumably may be subject to significant quantum mechanical effects. Yet naive quantum mechanical considerations 
then raise fundamental doubts as to whether the pickup signal even contains the phase information needed for transit- 
time cooling, whether this information can be reliably amplified and extracted, and whether quantum fluctuations 
in the incoherent signal from neighboring particles or arising in the optical amplifier itself lead to more significant 
heating than is accounted for classically. 
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A. Quantum Cooling Catastrophes? 



With so few photons on average in the relevant cooling component of the pickup signal for any given particle, simple- 
minded quantum mechanical thinking suggests possible quantum catastrophes for cooling. These lines of argument, 
while ultimately flawed, truly raised concerns and engendered debate, and are not merely straw men erected only to 
be demolished by more careful analysis. 



If, as in the standard treatment by Sands [5] of synchrotron radiation damping in electron storage rings, particles are 
assumed to radiate independently and at random, in a series of discrete, stochastic "quantum jumps" corresponding 
to Poissonian emission of a whole number of photons, at some average rate but at random times, and if the amplifier 
is imagined to faithfully multiply whatever photons are emitted, perhaps with the addition of some extra randomly- 
phased noise photons due to spontaneous emission or thermal noise [69], then on average any one particle emits a 
photon only once in every 0(a~ 1 ) passes through the pickup, while the neighboring particles in its sample emit on 
average a total of 0(N s a) photons, give or take O U/N^a\ . So for N s > O(10 2 ), one or more photons within a 
coherence length, randomly phased with respect to the particle in question will likely be present in the pickup signal 
and be amplified on every pass. It might seem that each particle will be subject to an appreciable heating kick on 
each pass, but usually experience no cooling kick whatsoever on most passes, but then every 0(a^ 1 ) turns or so, 
suddenly receive a large cooling kick, comparable in magnitude to the typical heating kick per pass. Such stochastic 
discreteness effects might be expected to drastically lower the cooling rate compared to that calculated classically, or 
possibly even lead to unstable feedback preventing or suppressing cooling altogether. 



Because the sample size N s is quite small compared to that in most conventional stochastic cooling schemes, and 
the coherent signal intrinsically small, it also seems possible that the fluctuations in the pickup signal may no longer 
be dominated by the classical shot noise associated with random particle positions within the beam, but instead or 
additionally include quantum fluctuations of some sort. In particular, if the radiation emission from each particle 
consists of the occasional random emission of a photon in a "quantum jump" at some random time while the particle 
undulates in the pickup wiggler, then the phase of these photons would be expected to be very poorly determined. 
Even if the photon is perhaps more realistically considered to be emitted over some finite formation length, presumably 
the wiggler length itself in the lab frame, in the relevant regime of small emission probability per pass, this might 
seemingly make the phase uncertainty worse, not better. But in transit-time cooling schemes, the particle phase-space 
information used for cooling is encoded almost exclusively in the phase of the signal, so that even if the self-field is 
present, it seems that it might not carry the phase information necessary for transit-time cooling to work. Because 
the average photon number emitted by any one particle is small, the variation or uncertainty in this number is also 
small in absolute terms, and the "number-phase" Heisenberg uncertainty principle, 



1. Do individual particles radiate at random in "quantum jumps" ? 



2. When individual particles do radiate, is the phase even well-defined? 



ANAcj) > 



(12) 



suggests that the phase [70] of any emitted photons is very poorly determined. If photon emission is assumed to be 
at least approximately Poissonian, then AN w N 1 / 2 , where "N is the average number of photons; but if j\f ~ 0(a), the 
phase uncertainty must satisfy 




(13) 



But if essentially all of the particle phase-space information useful for cooling is contained in the optical phase of the 
pickup radiation, and this is unavailable, cooling cannot occur. 
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3. What of spontaneous emission or thermal noise in the amplifier? 



In addition to the discreteness effects in photon-emission and photon-phase noise, the amplifier is expected to add 
the equivalent of one or more photons to the pre-amplified signal due to unavoidable spontaneous emission within 
the active gain medium, and perhaps as well some thermal or other additional noise photons. These photons will 
be randomly-phased, so this source of noise acts more or less like having some extra number of particles N n in the 
sample, in addition to, and for our parameter regime comparable or greater to, the actual number N s . Therefore, if 
the cooling does work despite our concerns addressed above , this may thereby affect the cooling rate in degree, but 
should not quell cooling altogether. 



4- Stochastic Cooling in the Quantum- Jump Model 



Fortunately, these fears of quantum effects catastrophically slowing or suppressing cooling will turn out to be mis- 
guided; the only source of quantum noise actually present is of the final sort mentioned: additive amplifier noise. But 
it will be informative to incorporate these various considerations into very approximate quantitative estimates for 
the longitudinal (or rather, energy-spread) cooling rates with various naive quantum effects incrementally included 
for comparison, in order to make this intuitive reasoning more precise and to better understand later where it goes 
wrong. 

The skeptical reader may again suspect that these arguments are introduced disingenuously, but they accurately 
represent the very concerns that motivated this investigation, and with some basis. As mentioned previously, a 
very similar discrete model of radiation by photon emission was used, and by all appearances quite successfully, by 
Sands [H] to treat radiation damping in a synchrotron ring, where quantum fluctuations offset damping, leading to 
finite equilibrium emittances, and is also commonly used to analyze laser cooling of particle beams by Thomson 
scattering [§]. 

The energy kick given to the jth particle on a single pass is 



rnc^A-fj — q J dtE(xj(t),t) -Vj(t), 



(14) 



where E is the transverse optical electric field seen by the particle in the kicker, Xj(t) is the particle's quivering spatial 
trajectory, governed predominately by the large magnetic field of the plane-polarized kicker wiggler, vrt) = c(3j(t) — 
4fx(i) is its velocity, tj is its arrival time at the front of the kicker wiggler, and Atj is the time spent inside. 

As we are interested in a rough comparative scaling, we will use a simplified "back-of-the-envelope" model of longi- 
tudinal cooling where we assume all fields and trajectories oscillate sinusoidally, neglect end effects in the wigglers, 
diffractive and other transverse variation in the fields, dispersion and nonuniform gain in the amplifier, and vari- 
ous other details, consider beam particles to be highly relativistic (i.e., -fj ^> 1) but already relatively cold, (i.e., 
\jj — 7o| <C 7o), and assume the static wiggler fields are large compared to radiation fields both before and after 
amplification. Keeping only the lowest order contributions in 7 _1 , the (normalized) energy kick per pass can then be 
estimated as 



A 7j 



qa u N u X u 
2mc 2 /3 7o 



(15) 



where PqC and mc 2 jo are the average beam velocity and energy, respectively; Ej is the amplitude of the wiggler 
field produced by particle j before amplification, assumed to be a sinusoidal plane wave with a rectangular envelope 
corresponding to exactly N u periods in the pickup wiggler; 4>ij is the relative phase delay or advance between the 
transverse quiver velocity of particle i and the amplified pickup radiation from particle j at the entrance to the 
kicker, as determined by their separation in the pickup and the bypass beam optics; G is the power gain of the 
amplifier, assumed constant over the relevant bandwidth for wiggler radiation, and rjj is a stochastic variable with 
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vanishing mean, representing the additional amplifier noise arising from thermal effects and/or spontaneous emission. 
We have supposed that the spatial trajectory of each particle is classical, determined by the wiggler fields and initial 
conditions upon entering the cooling system, i.e., neither space charge nor other collective effects, nor recoil or multiple 
scattering effects from the radiation fields, will appreciably perturb the spatial trajectories of the particles on any one 
pass, although by design the amplified radiation fields will perturbs the energy of the particle while in the pickup. The 
sum in (151 is taken over some effective number N s particles in particle j's sample, where to compensate for neglecting 



the assumed finite temporal extent of each particle's pickup radiation, such that any one particle is subjected to only 
part of the field arising from a neighboring particle while quivering in the kicker wiggler, this effective sample size will 
be somewhat smaller (i.e., by some numerical factor of O(l)) than the actual number of particles within a coherence 
length N u Xq of the given particle. We have also assumed that after amplification the optical field can be treated 
classically, although it can retain the (amplified) stochastic fluctuations arising from its quantum origins. 



Excluding phase noise in the radiated fields, ideally the phase-delay <pjj is arranged through appropriate choice of 
bypass optics to be a function, ideally a nearly linear function, of the deviation Sjj in particle energy: 



(16) 



where S^/j = jj — 70 is the (normalized) energy deviation of the jth particle from the reference orbit, and the sign 
of /ii is chosen so that the coherent signal on average provides a restoring force, nudging the jth particle toward 
the reference energy. (If transverse phase space is also to be cooled, then the phase delay will have contributions 
proportional to the betatron errors as well. This is neglected here.) 



The longitudinal cooling rate, or inverse cooling time-scale r c , 
RMS energy deviations in the beam are damped: 



may be defined as the instantaneous rate at which 



.I|log(5 7 |)«-i/ c 



(A 1 ])+2(5 lj A lj ) 



(17) 



where f c is the frequency of passes through the cooling system(s) , Ajj is the cooling kick as given by ( 15 1 , and averages 



are performed over the current phase space distribution of the particles in the beam, and over any stochasticity in 
their radiation emission as well as over any noise from the amplifier. 



Assuming purely classical emission, the self-field amplitudes Ej and phases 4>jj are deterministic functions of the 
single particle energy, (itself known only in a statistical or ensemble sense), while the phase differences <pij for i =/= j 
are determined by shot noise describing the random particle positions within the beam. Further supposing the time- 
of-fiight delays and amplifier gain are chosen to be locally optimal at all times during the cooling (not necessarily 
leading to the globally optimal minimum cooling time), and for simplicity taking particle energy deviations to be 
Gaussian in the lab frame, and also assuming (N s + N n ) > 1, the longitudinal cooling rate becomes approximately 



2 1 

e 5 (N B +N n ) 2 



+ (N a + N n ) + 



4^ 1 
e 4 (N s +N n ) 



(18) 



where N n is proportional to the noise power (rj 2 ) in the amplifier, expressed in terms of an equivalent number of extra 

sample particles, and we have neglected corrections of order O ( ^ N y 3 J or higher. Classically, N n cannot vanish, 

but it could in principle be made arbitrarily small by sufficiently strong pumping and simultaneously cooling of the 
amplifier. 



Instead, if we incorporate our naive quantum mechanical intuitions about emission fluctuations by imagining that the 
particles emit discrete photons in a Poissonian fashion, but continue for the moment to ignore Heisenberg phase noise, 
then <j>jj is still regarded as deterministic (in the sense explained above) but E? may be taken to be proportional 
to a Poissonian random variable with some probability p rad ~ a of photon emission per pass. Since p rad <C 1 in our 
regime, we can approximate each Poissonian random variable by a binomial variable, effectively neglecting the very 
rare emission of two or more photons, and we find after some algebra that in this case the optimized cooling rate 
becomes 



Pr.d fc 



e 5 (N 3 +N n ) 2 



+ (N s + N n ) + 



_4_ 1 

e 4 (N a +N n ) 



(19) 
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which as expected is indeed slower by a factor of p lad ~ O (a" 1 ) than the fully classical prediction. This drastic slow- 
down occurs, of course, because the coherent cooling signal is by assumption only present on average in a fraction 
0(p laA ) ~ 0(a) of passes through the cooling section. 

While throughout our simple analysis we have assumed linearity of the cooling kicks in the fields and particle devia- 
tions, in which case stochastic cooling can work at some rate regardless of the signal-to- noise-ratio (SNR), nonlinear 
effects might lead to instabilities beyond some finite range of SNRs, so it is also possible that this large multiplicative 
noise might not just drastically slow down cooling but frustrate cooling altogether. 

Next, to incorporate the phase noise in addition to the Poissonian emission into our model, it seems natural to also 
treat the <j>jj as random variables with conditional means determined as above, but subject to random fluctuations, 
ostensibly with RMS deviations 

<^-> 1/2 ~ vfe ~ ° ( a " 1/2 ) ' (20) 

which are comparable to 2tt. Taking these assumed phase fluctuations to be Gaussian for simplicity, the cooling rate 
becomes approximately 

t- 1 « Iw/ce-<**«> ? „ ; -, (21) 

1 - e - 2 (W?i>+ 1 ) + (N s + N n ) 

to leading order in the small quantities e~^^j^ and N . With such phase uncertainty, the cooing rate would be 

suppressed by more than a factor of e~^n^ ~ O(10~ 3 ), because the cooling information carried by the phase of the 
self-fields has been corrupted with intrinsically quantum noise. 

Finally, quantum mechanics will enforce a lower bound for the amplifier noise N n w \(ri(t) 2 ) due to unavoidable 
spontaneous emission in the pumped medium responsible for the gain. Various quantum mechanical and semi-classical 
arguments (see for example, [10 for a survey) suggest a minimum amount of added noise equivalent in its final effects 
to one-half photon per mode entering the amplifier along with any actual signal present and amplified along with it. 
With our simple model of windowed plane waves, and an effective interaction length in the kicker (accounting for 
slippage between the particles and fields) equal to the coherence length N u \ of the radiation, each particle effectively 
interacts with a single mode, so that one would predict that at best N n ~ f ~ 0(a^ 1 ). While not appearing as 
a multiplicative slow-down in the cooling rate and therefore not as devastating as the other possible quantum effects 
described above, such spontaneous emission noise does indicate that for small sample sizes quantum fluctuations may 
be as important a contribution to the incoherent heating as classical shot noise from actual particles, and that these 
fluctuations will ultimately limit the cooling gains achievable by diluting or stretching the beam. So indeed we can 
we can roughly account for the spontaneous emission by increasing the effective sample size from iV s to N s + N n for 
some effective noise number N n ~ -/V amp /N ph , where N ainp is a measure of the effective added noise by the amplifier 
expressed as an equivalent number of photons at the amplifier front-end (i.e., prior to amplification. ) 



VIII. TOWARDS A MORE CAREFUL TREATMENT OF QUANTUM EFFECTS 



Fortunately, a more careful quantum mechanical analysis will reveal that most of these naive intuitions are incorrect, 
and that in effect only the additive amplifier noise is present, so that cooling rates are accurately estimated by classical 
continuous emission results, provided allowance is made for the unavoidable amplifier noise, which is ultimately 
quantum mechanical in origin, and cannot be made arbitrarily small without violating the Heiscnbcrg Uncertainty 
Principle or the unitary nature of quantum dynamics. That is, in a more careful quantum mechanical analysis neither 
the multiplicative emission- noise leading to the 0(a) slow-down in the naive Poissonian-emission model (191, nor 
the catastrophic slowdown appearing in the naive phase-noise model (211 actually occurs; in effect, only additive 
noise arising from spontaneous emission or thermal effects in the amplifier appears, leading to a cooling rate of the 
approximate form (18 1 for some effective noise number N n , whose minimum value is essentially constrained by the 
uncertainty princip 



A more rigorous treatment of OSC will force us to examine carefully each stage of the cooling dynamics: the particle 
motion in the pickup wiggler, the resulting radiation emitted, the quantum mechanics of the optical amplification 
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process, the particle-radiation interaction in the kicker wigglcr, and the resulting changes in the beam phase-space 
distribution. Because we are here interested in a proof-of-principle question rather than a detailed design assessment, 
we will make a number of simplifying assumptions, yet still incorporate the essential classical, quantum, and statistical 
physics of the processes so as to address the fundamental question of whether optical stochastic cooling based on 
amplification of small pickup signals is intrinsically flawed due to the effects of quantum noise, and ultimately arrive 
at a less pessimistic answer than what we arrived at above. 

A fully self-consistent, quantum mechanical (or worse, QED-based) treatment of beam particles, radiation fields, 
amplifiers and other optical elements would be prohibitively difficult, but fortunately is not actually necessary either. 
Rather than explaining the quantum mechanical features of the dynamics of the beam particles, we will in effect 
carefully explain them away, arguing that particles in the beam can be treated classically in their interaction with the 
wiggler, radiation, and any external focusing fields. 

Then we will verify that such particles do not radiate into photon number states, but rather Glauber coherent states, 
which are actually the states closest to classical radiation fields allowed by quantum mechanics, and quite different in 
their statistics from the states consisting of whole numbers of photons. Making certain idealizations, The radiation 
essentially retains this form as it passes through dielectric optic elements. 

Of course, the inverted population of atoms in the active medium of a laser amplifier behaves in a highly non-classical 
manner, but the precise dynamics of the amplifier need not be evolved explicitly. Very general considerations of 
amplifier action will be sufficient to determine the action of the amplifier on the pickup field in an "input-output" 
formalism where the specifics of the intermediate dynamics can be neglected, and to characterize the best-case limits 
on the additional noise introduced of the amplification process without resorting to an explicit microscopic model of 
the amplifying medium. 

Once amplified, the radiation behaves entirely like a classical but noisy field, both in its statistics and its interaction 
with the beam particles in the kicker. 

So once these results are established, a simple estimate of the cooling rates can be made essentially along classical 
lines, just accounting for any extra noise in the field due to amplified spontaneous emission. 



IX. PARTICLE DYNAMICS ARE CLASSICAL 



For beams of electrons, muons, or protons, at relevant energies and emittances, and with realistic wiggler strengths, 
the de Broglie wavelengths associated with the particles' longitudinal and transverse motion are extremely small 
compared to the radiation wavelength, wiggler period, beam dimensions, and other relevant scales, so we will argue 
that the particles can be treated as classical point particles obeying classical relativistic kinematics. The subsequent 
analysis of the radiation will be vastly simplified by assuming the dynamics of the particles in the pickup wiggler to 
be classical, and in fact with prescribed classical trajectories determined by external fields only, so the effort needed 
to carefully establish and justify the accuracy of these assumptions will be subsequently rewarded. 

Clearly in order to adequately describe particles classically, all statistical and dynamical manifestations of quantum- 
mechanical or quantum-electrodynamical effects on the particle DOFs must be negligible. Often, statistical and 
dynamical effects are not clearly distinguished, but it not difficult to find regimes where cither the classical statistical 
limit or classical dynamical limits may be valid, but not both. However, it is often the case for a system consisting 
of many particles that classical statistical noise can also tend to mask the dynamical manifestations of quantum 
noise. Both limits of course are related in part to the size of the typical de Broglie wavelength X^b associated with a 
quantum mechanical particle, but ignoring quantum statistical effects typically requires that Xjb be small compared 
to average inter-particle spacings, while ignoring quantum dynamical effects typically requires that XdB remain small 
compared to all relevant dynamical length scales. For example, in a plasma or ionized gas at rest (on average), if 
^dB ~ 7 \ rp <C n~3 , where n is the particle number density, T is the temperature, and k B is Boltzmann's constant, 
then quantum statistical effects can be ignored and the particles can be taken to satisfy Maxwell-Boltzmann statistics, 
as if they were classical point particles, whether they are identical fermions or bosons. But unless XdB "C £ L where 

2 

~ jr^f is the Landau length, or typical distance of closest approach between two plasma particles, then the Coulomb 
scattering between particles should be treated quantum mechanically. 
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A. Quantum Statistical Degeneracy 



In the case at hand of a relativistic, charged particle beam of indistinguishable fermions, the safe neglect of quantum 
statistical effects requires that the beam be non-degenerate in the average rest frame: 



where Aj 



(p / 6)- | »A' dB , (22) 
is the typical thermal dcBoglic wavelength, T" is an effective temperature, which may be different 



dB Vmfc B T' 

for the longitudinal and transverse DOFs, and m is the invariant particle rest mass. We will assume throughout that 
the average beam energy mc 2 7 and beam energy spread mc 2 8^ satisfy 7 » 1 and y « 1, even before stochastic 
cooling is applied. Since a u ~ O(l), this also implies that the longitudinal momentum p inside the wiggler is only 
negligibly smaller than the average beam momentum p = mc~//3 outside the wiggler. Some of the longitudinal beam 
kinetic energy is converted into energy of transverse quiver in the wiggler magnetic field, but assuming that transverse 
canonical momentum is conserved in the presumed transversely-uniform wiggler fields, and that particles enter the 
wiggler approximately on axis, one finds 



while by assumption (3 ~ 1 and 



p 4 /3 2 7 2 ' 

<C 1, so we will ignore this distinction when convenient. 



(23) 



Since motion in the wiggler field will not significantly effect the beam temperature, average density, or energy, for 
simplicity we can analyze the beam while it drifts freely, prior to its entering the pickup wiggler. In this limit of high 
energy and relatively low energy spread, one finds from relativistic velocity addition that the rest-frame longitudinal 
temperature can be expressed as 



k B T,\ ~ mc (3, 



2 all 



1 



mc 



/3V 



(24) 



where f3c is the average beam velocity in the lab frame, with <~ 1, and £7 w ^ is the lab-frame beam energy 
spread. Since momenta perpendicular to a Lorentz boost remain unchanged, the transverse rest-frame temperature 
can be written as 



k B Tj_ — 



(p'l) „ Spl 



2m 



m 



(25) 



where Sp± is the root-mean-squarer transverse momentum spread in the lab frame, related to the conventional 
transverse beam emittance ej_ ~ <7 r ^^ ± , where <r± is the transverse (radial) beam size. From Lorentz contraction, it 
follows that the particle density must transform as n' b = rib/ J, so the conditions for non-degeneracy, written in terms 
of lab-frame quantities, become 



Pb < 7 



1 

a! 



#7 


3 




. 7 _ 







1_ 

~3 ' 



and 



Pb < 7 



5_A 
h 3 



a! 



mc 



(26) 



(27) 



where A c = ^ the Compton wavelength. For achievable beam densities and emittances, even after cooling, these 
conditions are typically easily achieved by many orders of magnitude; existing particle beams are extremely far from 
being degenerate, and quantum statistical effects arising from the Pauli Exclusion Principle can be completely ignored. 



B. Pair Creation or other QED Effects 



Although the particles are assumed to be highly relativistic in the lab frame, they are all streaming in the same general 
direction, with small relative variations in momentum, so particle-scattering will involve small center-of-momentum 
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energies, and hence particle/anti-particle pair creation or other exotic QED scattering effects should be negligible. In 
the average rest frame, this requires 

mc 2 ((3' 2 ) < 2mc 2 , (28) 
equivalent in the lab frame (after dropping some factors of two) to the requirement that 

— « 1, (29) 

7 

which has already been assumed. Of course, muons will spontaneously decay in a random, Poissonian fashion, which 
is a completely non-classical process mediated by electroweak interactions, but the resulting electrons will be lost from 
the beam as soon as it is bent in magnetic fields calibrated for the heavier muons, so this process can be ignored for 
particles which remain in the beam throughout the full cooling process, requiring several turns in the cooling ring. 



C. Spin Effects 



Since spin degrees of freedom are intrinsically quantum mechanical, at least for apparently non-composite particles 
such as muons or electrons [7T], all spin effects should be negligible in a truly classical treatment. The energy 
associated with the spin of an elementary fermion in the wiggler field B u is U sp i n ~ ^B„ = ^a u hk u c. In order 
to safely neglect spin effects, this energy U sp i n should be smaller than all other relevant energy scales, including: 
the total particle particle energy 7771c 2 ; particle quiver kinetic energy, which for 7 1 and a u ~ 0(1) is about 



c— 
7 



2 



|mc 2 — ; the mean photon energy Hluq; and the total radiated energy per particle, U ra d, 

z 7 

which is larger by about 0(N U ) than the power radiated into the coherent mode. Using the relativistic Larmor 
formula, the total radiated energy per particle in the pickup wiggler can be estimated as 

tt p A + 2 q 2 dp» dpy N U X U q 2 4h2 N u X u 2 

U r ad = Prad&t = 77 o q ~j ~ ~~ 7 P ~ aN u a u hw . (30) 

3 vnrc? dr dr c c c 

These requirements imply, respectively, that 

7 3 

hu> <C mc 2 — ; (31a) 
a u 

huo <C 7a u mc 2 ; (31b) 
7 2 > a u ; (31c) 

7 2 » — !— ■ (31d) 

a]y u a u 

Since hojQ = j^mc 2 , the first two conditions can be written equivalently as A c -C 7^ Ao and A c -C 7a„Ao. For optical 
or near infrared frequencies, a wiggler parameter a u ~ 0(1), N u ~ 0(10) wiggler periods, and 7 ~ O(10 2 ) or O(10 3 ), 
these conditions are all easily satisfied. 



To be thorough, we should examine not just energies but forces associated with spin DOFs. As pointed out by Bohr 
and Mott as early as 1929 |11], consistency of the Correspondence Principle demands that any non-composite particle 
whose spatial DOFs are behaving classically should also exhibit no observable spin effects, because the magnitude 
of the spin and associated action is bounded, and there is therefore no way to consider its classical (high-action) 
limit independently from that of the spatial DOFs. Evidently, this suppression of spin effects must be enforced by a 
"conspiracy" between the Hciscnberg Uncertainty Principle and the Maxwell-Lorentz equations [72;. In particular, for 
charged elementary fermions, the magnetic field produced by the spin of one particle and felt by another particle will 
be swamped by the magnetic field produced by the moving charge associated with the first particle. At a characteristic 
distance r' ~ n /— 3, in the average rest frame, the magnetic field due to one spin is roughly -B' pin ~ ^mcr^' while 
the magnetic field due to the same particle's motion will scale like B' v ~ ^r^n- If the uncertainty in position Sr' 
satisfies Sr' -C r' , consistent with the particle's spatial DOFs behaving classically, then it can easily be shown that 
B' spin <C 6B' V ,, where 5B' V , is the uncertainty in the Biot-Savart field B' v , due to the Heisenberg uncertainty in the 
velocity v' . Therefore, spin-spin effects will be negligible in a non-degenerate beam of charged leptons. 
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Although ideally the magnetic field of the wiggler is taken to be transverse, transversely uniform, and plane polarized, 
such spatial variation is not strictly consistent with Maxwell's equations, which demand that the wiggler magnetic 
fields are both curl-free and divergence-free in the neighborhood of the beam line, requiring in addition to the principle 
sinusoidal transverse component at least some small longitudinal component and some transverse gradients with scale- 
length O(Ao) in the neighborhood of the beam axis. With such self-consistent field profiles, it can be directly shown 
that the transverse component of the dipole force on the spin, given by, Fd p — 2^(s ■ V)£? M , will be small compared 
to the transverse component of total Lorentz force on the charge, which is given by Fl\v x B u , if 

Sv± <C v z , (32a) 
hk < 7 2 /3mc, (32b) 

where Sv± w ^r^-P-L is the transverse velocity spread, and fco = ^ is the central optical wavenumber; while the 
longitudinal dipole force on the spin will be negligible compared to the longitudinal component of the Lorentz force, 
provided 

huj <C 7mc 2 , (33a) 

-Nla u \ c < A , (33b) 

7 

hk < — — 7 3 /3mc, (33c) 
all readily satisfied. Spin degrees-of-frccdom arc here unimportant to all aspects of the particle dynamics. 



D. Transverse Motion 



Next, we argue that quantum mechanical effects in the transverse motion are unimportant. In order that this be the 
case, the spread in a particle's transverse wavefunction must remain small relative to other relevant length-scales, 
including the transverse beam size, the amplitude of the transverse quiver motion, the range of transverse variation 
of the wiggler field, as well as the optical wavelength. Also, the transverse velocity fluctuations associated with 
quantum mechanical uncertainty demanded by the Heisenberg uncertainty principle must remain small compared to 
the transverse quiver velocity and to the transverse velocity spread in the beam. Neglecting any initial transverse 
particle velocity and any transverse spatial variation in the wiggler fields, conservation of canonical momentum implies 
that the transverse quiver velocity will be v± ~ c 9 ^-. Since we assume ^ C 1, the transverse quiver motion can be 
taken to be non-relativistic, provided we replace the rest mass m everywhere with the "effective" relativistic mass 7m 
to account for the increase in inertia arising from the high longitudinal velocity in the lab frame. Furthermore, we 
will neglect the transverse forces due to the wiggler fields or external beam-optical fields. Because classically these 
fields will provide focusing (at least in one direction) on average, semi-classical considerations suggest that they will 
act mostly to inhibit the spread of the wavefunction, so ignoring them altogether should constitute a conservative 
approximation. 

To get an idea of the scaling, we suppose each particle is described by a Gaussian wavepacket, initially (at t = 0) 
separable in the transverse (x and y) coordinates, and having some minimum initial variance in each component 
of position and momentum consistent with the Uncertainty Principle, but then freely streaming during the total 
interaction time At « the wiggler. The dynamics for the different transverse directions are identical, 

so we need follow only one component, say in the x direction. The variance in position at a subsequent time t will be 
given by 

A* 2 (t) = A* 2 (0) + |^Ax 2 (0) = A* 2 (0) + (34) 
while the variance in momentum, 

Apl(t) = A P 2 (0), (35) 
is constant, since transverse forces have been neglected, and we have further assumed that the initial conditions satisfy 



Az(0)Ap x (0) « ^. 



(36) 
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By differentiating the expression for Ax 2 (t) with respect to Ax 2 (0), one finds that the spatial variance is minimized 
at any subsequent time t > by 

Az 2 W = i^. (37) 

Such a wave-packet with minimal spatial spread is the closest thing to a classical point-particle allowed by quantum 
mechanics, and evaluating the spatial variances at t = At, we have 

Ax 2 (At)~i^. (38) 

In order that the particle behavior remain classical during this interaction time, this variance should be sufficiently 
small in several senses: it is necessary that this transverse spatial spread be small compared to the transverse beam 
dimensions, i.e., that Ax(At) <C cr±, or 

KK « ^-Jo\- (39) 
that it be small compared to the transverse extent of individual particle orbits, i.e., that Ax(At) <C c^^r, or 

A C «^A U ; (40) 

that it be small compared to the transverse range of variation for the wiggler magnetic field, which from Maxwell's 
equations (specifically, VxB = 0) will be comparable to the longitudinal length-scale of field variation, namely X u ; 
i.e., that Ax(At) <A„, or 

A c « ^iK; (41) 

and finally, in order that the particle's phase in the radiation field be well-defined, we require that the phase uncertainty 
associated with this transverse uncertainty in the presence of wavefront curvature remain small: fc <56>Aa; <C 7r, or 
roughly 

A C «|A . (42) 

For sufficiently long wigglers (N u 3> 10), some of these conditions could be violated, but in that case a more sophis- 
ticated analysis including transverse focusing effects would be needed to assess the classicality of the particle orbits. 
In our regime (N u < (9(10)) they are met. 

So far we have followed the conventional reasoning, but we actually need to be more careful, because up to now 
we have only considered spatial spread, but in fact the wave-packet that minimizes transverse spatial variances 
has arbitrarily large variance in transverse momenta as t — > 0, violating the requirement that quantum mechanical 
transverse momenta uncertainties should also be small compared to the transverse momentum spread in the beam and 
to the wiggler-induced quiver momentum. It turns out that with a lengthy calculation for an appropriate Gaussian 
wavepacket, the uncertainties in both transverse position and momentum can be made sufficiently small simultaneously 
provided the above conditions remain satisfied, and in addition 

Xt«4-^, (43a) 
^ < 47ra u , (43b) 

which are readily satisfied. 

Including the effects of the wiggler fields should not change these conclusions, provided the wiggler field itself acts 
classically. In the average rest frame, the wiggler fields can be treated via the Weizsacker- Williams approximation as a 
traveling electromagnetic plane wave consisting of virtual photons, which then scatter off the particles to produce the 
real wiggler radiation. Over the length of the wiggler and the transverse area of the coherent radiation, the number 
of virtual wiggler photons is very large: 

a-X 2 $f »1- (44) 

Actually, even over the much smaller volume defined by the wiggler length, the transverse extent of the particle's 
quiver, and the classical muon radius r^, the number of virtual wiggler photons is still large: 

AT„^»1. (45) 

Additionally, by definition the wiggler field is very coherent, with "quantum" uncertainties in the virtual photon 
number density very small in a relative sense, and uncertainties in phase very small compared to 2tt. One could not 
demand more from what is regarded as a classical field in what is really a quantum mechanical world. 
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E. Longitudinal Dynamics 



In a similar manner, we can show that the longitudinal dynamics can be taken to be classical, although unlike the 
transverse motion, the longitudinal motion is highly rclativistic. For the moment, we will assume that the particles 
arc freely streaming longitudinally, unperturbed by wiggler, space-charge, radiation, or other fields. Because spin, 
ZBW, and other rclativistic quantum effects can be neglected, we can fortunately forego use of the Dirac or even 
Klein-Gordon equations, and simply use a one-dimensional Schrodinger equation with the positive-energy branch of 
the relativistic dispersion relation: 



huj(k) = \J m 2 c 4 + c 2 h 2 k 2 — mc 2 . 
We assume the initial state is a minimum-uncertainty Gaussian wavepacket 

1>(z,t = 0) = 



(46) 



(47) 



where Az{t) 2 is the longitudinal spatial variance at time t, and hk = mc~ff3, is the average longitudinal beam 
momentum. Taking a Fourier transform, this can be written as 



l 



■i k z 



where 



i>{k) 



2Az(0) 



dk ip{k)e 



-(fc-t) 2 Az 2 (0) 



Using the dispersion relation ( 46 1 , the freely-propagating solution at later times can then be written as 

dkip{k)e tkz ~ tu{k)t . 



i>(z, t) 



i 



(48) 



(49) 



(50) 



Since y < 1, the relativistic dispersion relation ( 46 1 can be expanded in a Taylor series about k 



f[ 7 -l]+c|3(i;-t) + !i(j - /.•) 



2 7 3 m 



in <:~ 

n 



cf3k- 



1 h 

2 7 3 m 



(k-kf 



2 Az 2 (0) ' 



After a little algebra, we find the longitudinal variances to be 

A,^(0 = A^(0) = : 

and 

A, 2 (i)«A, 2 (0) + i^ 5W =A, 2 (Q) 
Differentiating, we find that the spatial spread is minimized at any subsequent time t > by the choice 

A^ 2 (t)=^. (54) 



Ap 2 z (0)t 2 
m 2 7 6 



(51) 

(52) 
(53) 



Over the interaction time At « " " , this spread Az(t) must remain small compared to the smallest length-scale 
associated with variations in longitudinal forces experienced by the particle, namely the optical wavelength Ao, so 
that the particle can behave like a point particle and can be taken to have a well-defined phase in the optical field. 



That is, classical behavior requires Az(At) <§; Ao, or 



Since X u ~ 7 2 Aq, this condition is usually written in the form 



4-7T 7 



< An 



(55) 



(56) 
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In the FEL literature, this condition (56 1 is often the single requirement mentioned for allowing a classical treatment 
of electron dynamics, but again rather more care is needed, because both position and momentum uncertainty must 
remain sufficiently small for a classical treatment to be accurate. Consistency demands that the quantum mechanical 
uncertainty in particle momentum Ap z (t) at least remain negligible compared to the classical longitudinal momentum 



spread of the beam, dp ~ -^j-p ~ mcSj. For a plane-polarized wiggler field, another small longitudinal momentum 



— 1 

scale exists, namely the extent of longitudinal momentum variation due to the "figure-eight" particle orbits in the 

a 2 

plane-polarized wiggler fields, which, from energy conservation, can be shown to be approximately 8p\ rs -f po in 
terms of momentum excursions in the lab frame. A little algebra reveals that uncertainties in position and momentum 



can be made simultaneously negligible provided (56) is satisfied, as well as the conditions 



^«47r,5 7 (57a) 
^«4tt^. (57b) 

We have so far neglected the effects of any longitudinal forces. The fast harmonic motion along z in a planar wiggler 
should only slow the spatial spreading, or at least not significantly increase it, as should any ponderomotive bunching 
in the electromagnetic fields. Coulomb repulsion from the space-charge fields will be de-focusing, and might lead to 
additional spreading of the wave-packets, but classically this force is typically negligible compared to the other forces 
in the present parameter regime, as we will see shortly. 



F. Radiation Reaction 



However, in both the longitudinal and transverse dynamics, we have so far ignored a potentially important force, 
that of radiation reaction, or direct recoil. In order to consistently treat the particles classically but the radiation 
emission quantum mechanically, the effects of radiation reaction, or particle recoil, must be negligible [73 . Intuitively, 
we expect that the total effect on any particle due to the spontaneous wiggler radiation fields, including self-fields, 
should be negligible simply because the radiation field strengths are much weaker than those of the external wiggler 



fields, i.e., a\ <C a u , where a± is the normalized vector potential for the spontaneous wiggler radiation. Using (30 1 



as an estimate for the total power radiated per particle, assuming each particle emits into a cone with synchrotron 
opening angle 89 ~ ~ (larger than the coherent opening angle by y/N u ), and assuming the particle positions are 
governed by shot noise, so the emission from different particles is randomly phased and therefore adds incoherently 
in intensity, we find that Oi -C a u provided 

a 2 N u [nal\ c ] ^ ~ ^a 2 N u na 2 ± X c ^ « 1. (58) 

While easily satisfied in the parameter regimes of interest here, this is essentially a far-field condition, while the 
radiation reaction force is manifestly a near-field effect, so more care is again needed. 

In order that we can ignore recoil effects in the longitudinal motion, the average energy of radiation per particle should 
be small compared to the average particle energy, i.e., 

aNuO^hajQ <C jmc 2 , (59) 

or equivalently 

«a'^«A . (60) 

Because the observed power radiated per particle (if it could be isolated) would be subject to large (Poissonian) 
fluctuations, we should really demand that the energy of any single radiated photon be small compared to the average 
particle energy: 

hu <C r ymc 2 , (61) 

or equivalently 

iA c « A . (62) 
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Actually, we should impose still stricter conditions. In the average rest-frame, the energy change due to the recoil 
from a particle scattering one virtual wiggler photon into one real radiation photon ought to be small compared to 
the typical RMS kinetic energy of a particle. Because Sj 1 » 2^2, back in the lab frame, this requirement entails 

1 7 2 « ^, (63) 

setting an ultimate bound on how cold the particle beam can be or become and retain classical behavior, which is 
nevertheless easily satisfied for any remotely accessible beam temperatures. Since the particle cannot actually absorb 
energy from the static wiggler magnetic field, perhaps we should even more strongly require that the energy of a single 
emission or absorption event by a particle (ignoring inconsistencies with momentum conservation) be small compared 
to the RMS particle kinetic energy, in the average rest frame. That is, we demand hw' <C mc 2 Sj' , or 

which apart from neglected factors of 2 amounts to the same thing as requiring that the lab-frame radiated photon 
energy is small compared to the lab-frame energy spread, i.e., 

hu>o <C mc 2 Sj, (65) 

which is reassuring since the Correspondence Principle Limit should be relativistically invariant. This condition 



is mathematically equivalent to the previous condition (57a I despite being deduced from very different physical 
considerations. 

In order that the particle continue to have a well-defined phase in the optical field after emission, it is also necessary 
that the total particle recoil over the interaction time, due to one photon emission, remains small compared to the 
optical wavelength. For a photon radiated in the forward direction, the momentum kick is roughly 

hk = Ap = mcA ( 7 /3) - mc (7A/? + 7 3 /3 2 A/3) - mcj 3 A(3. (66) 
We demand that < A , or 

^ « A . (67) 

This is precisely the same condition found above for the longitudinal quantum mechanical spreading of the wave- 
packet, without consideration of recoil effects or other longitudinal forces, to remain negligible. 

Transversely, off-axis photons are expected to be emitted with a polar angle 9 of at most about 9 ~ — , due to relativistic 
"head-lighting" effects. The resulting momentum kick will be small compared to the transverse momentum spread if 
ififco < Sp ± , or 

H«7 2 ^™ 2 , (68) 

where again p = mcyP is the average beam momentum. The transverse recoil should also be small compared to the 
transverse quiver, ^liko <C mca u , or 

hu)o <C a u jmc 2 ; (69) 

and finally, the total transverse recoil over the interaction time must be sufficiently small so that the resulting 
perturbation in the particle's transverse position does not appreciably affect the phase of any subsequently emitted 
radiation. The transverse momentum kick is approximately 

±hk ~ A Pl _ « mc 7 A/3j_. (70) 

The total perturbation in transverse position due to this recoil is 

Az~cAp ± ^ = ±^jr u . (71) 

For nearly on-axis particles, the resulting perturbation in the phase of the emitted radiation, as collected at the end of 

A 2 

the wiggler, is somewhere between A<fi ~ fcoAs and A<fi ~ fco 2 n\ > depending on the particle's longitudinal position 
in the wiggler, so demanding A(f> <^ 2n, we find 

fcuj <g; 7 2 A r „T !L mc 2 ~ N u mc 2 (72) 
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and 



huj <C 



1/2 



" lll<~ 



(73) 



We have seen that recoil effects on any one particle are negligible for its subsequent dynamics, but in order that recoil 
can be fully ignored, its effects on the radiation field must also be small. From the Compton scattering relation, we 
see that in the average rest frame of the beam, the shift in frequency Soj' due to particle recoil during the scattering 
of a virtual wiggler photon into a real radiation photon in a Weizsacker- Williams treatment will be small compared 
to the central radiation frequency u' provided that, in the lab frame: 

2^«1, (74) 

or equivalently, 

A c < 2 7 A . (75) 
Actually, this Compton shift should also be small relative to the coherent bandwidth, requiring 

1 Hu>g ^ hui ( 7( y\ 
7 mc 2 ^ N u ' \'°> 

equivalent to the by now very familiar requirement that 

^A c « A . (77) 

We must also demand that the recoil from emission of a photon will not change the energy of any one particle so much 
that any subsequent emission by that particle will likely lie outside the original coherent bandwidth at the average 
beam energy, but a little algebra shows that this condition is again ( |77[ ) ; we begin to understand why in the literature 
this is often regarded as the condition defining the classical/quantum boundary for wiggler physics. 

As pointed out by Benson and Madey [12] and Brau [13], these standard recoil arguments really only indicate 
that expectation values of particle observables can be calculated classically. We must also verify that variances or 
fluctuations about mean values are dominated by classical effects such as shot noise in the beam. We expect that 
the most stringent condition for higher-order moments will arise from considerations of longitudinal motion, so we 
concentrate on this behavior here. We have seen that the average recoil of a particle due to photon emission is 
sufficiently small for a classical description to be valid, but it should also be the case that additional quantum 
fluctuations due to the variance in the photon emission are negligible. 

Now, it is known that the photon counting statistics for Compton scattering from plane waves is Poissonian, so 
for sufficiently uniform wiggler fields and a sufficiently relativistic beam, such that in the average beam rest frame 
the wiggler field can be represented as virtual photons in a plane-wave mode according to the Weizsacker- Williams 
approximation, photon counting statistics for photons scattered by any one particle should be Poissonian or close to 
Poissonian. Of course, it is precisely the correct understanding of the field statistics, before and after amplification, 
as experienced not by hypothetical photon counters, but by actual individual particles in the beam which is at issue 
in our entire analysis, and we are not yet equipped to make a careful calculation. But to arrive at a conservative 
upper bound for the variance or fluctuations due to recoil, we can assume that the photon emission statistics for each 
particle in the wiggler are Poissonian, and then demand that the extra quantum mechanical uncertainty in energy 
due to fluctuations in photon emission is negligible compared to the momentum uncertainty already needed to satisfy 
the uncertainty principle, which in the absence of these recoil fluctuations has already been shown to be negligibly 
small. Even though most of the total energy emitted by a given particle will be radiated at frequencies more than one 
coherent bandwidth below the central photon energy huj , and/or outside the coherent solid angle 59 ~ > to be 

conservative we will here assume that all power is emitted close to the forward direction near the frequency ujq. The 
expected total number of photons radiated per particle (including those outside the coherent mode), in the pickup is 
then estimated as n rad ~ aN u a\. The additional uncertainty in particle energy due to these fluctuations in radiation, 
assuming Poissonian statistics, is then 

S% ad ~ = t V^aPu- (78) 

For a classical-behaving wave-packet, the uncertainty in longitudinal position at the end of the pickup interaction 



should be no more than a few times the minimum uncertainty allowed as given by ( 54 1 , which corresponds to an 
approximate lower bound on the Heisenberg uncertainty in beam energy: 



*W > y/gfc. (79) 
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Demanding that <57 rad <C <57 HU pi we find that 

Xo^X«l ; (80) 
which will be readily achieved in situations of interest. 



G. Summary of Arguments for Classicality of Particle Degrees-of-Freedom 



We have uncovered a plethora of conditions in support of our claim that the particle dynamics in the pickup wiggler 
can be treated classically. Perhaps we should review the most important: pi, <C (ir) 3 ^ an< ^ P b ^ X 3 " (^y~) 3 ' so that 
the beam particles are non-degenerate; ^ <§C 1, so that pair-creation and other exotic QED effects can be ignored; 
— < 1, so that certain spin effects can be neglected; and 



< min 

mc 2 



iW 'v n n °^ ^ N 7 J- 7 T 3 7 §P± ySp± ] /oi\ 

" h h ««; "« h 7 ) Qu ) iy «7 v /]\^' W u ' JV*a u ' N M a u ' aAT>J ' mc ' mcj> V 01 ; 

so that a variety of other potential spin, transverse, and longitudinal effects may be ignored 

Note that the condition N tf c <C Ao, or <C j^-, is often mentioned as the fundamental condition for classical 
behavior of particles in wigglers, because it simultaneously ensures at least two fundamental conditions for classicality: 
that the longitudinal spreading of a single-particle wave-packet over the length of the wiggler in the absence of emission 
remains small relative to the shortest force length-scale, which is the optical wavelength; and that if a photon is emitted, 
then the resulting recoil is sufficiently small so that the resulting shift in particle position over the interaction time in 
the wiggler is small compared to an optical wavelength. However, for short wigglers this condition, while important, 
is not typically the most stringent of the ones we have established. 

It should also be acknowledged that, strictly speaking, all the inequalities deduced above, individually or collectively, 
constitute neither sufficient nor necessary conditions for the observation of exclusively classical behavior on the part 
of the beam particles. The failure of any one condition does not necessarily indicate that quantum behavior will be 
observed, but only that it will be necessary to perform an actual quantum-mechanical calculation to determine whether 
the quantum corrections to classical behavior will be appreciable or not. That is, they are necessary conditions for 
avoiding a detailed quantum treatment. 

Conversely, certain of these conditions determine whether sufficiently classical-looking quantum states for the beam 
exist, but even when such states are available there is no guarantee that the particles will be prepared in or remain 
in such a state. However, it remains a poorly understood but widely observed tendency for systems, especially 
macroscopic systems consisting of either massive objects or of many interacting constituent parts, to actually behave 
classically whenever they can behave classically. The emergence of a classical world from quantum physics remains 
a fundamental and only very partially understood problem, although some recent progress has been made in this 
direction by Zurek |14[ 1151 116] and others concerning the analysis of decoherence and so-called environmentally- 
induced superselection rules. 

In the case of particle beams at least, all empirical evidence suggests this is true. We believe that the resolution lies in 
the realization that one does not typically track or make observations on individual particles but rather the particle 
beam as a whole, or at least some slice or segment of the beam, which always contains many particles and is always 
subject to uncertainty due to uncontrolled classical noise from the environment and/or our limited initial knowledge 
and subsequent measurement precision. Recall that the closest classical analog of a quantum state, even a single- 
particle state, is not a particle trajectory but rather a phase-space distribution, or statistical ensemble of trajectories. 
If there are sufficiently classically-looking quantum states for individual particles, consistent with what is known 
macroscopically about the beam, typically consisting of only a few parameters such as average dimensions, density, 
energy, and the longitudinal and transverse emittances, and if furthermore the classical emittances are sufficiently 
large, then classical statistical uncertainty can simply swamp quantum fluctuations, and the density matrix for the 
beam as a whole may be essentially indistinguishable from a proper statistical mixture of the most nearly-classical 
pure states, regardless of whether individual particles were ever actually prepared in or remain in these particular 
classical-looking states. 
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X. CLASSICAL SINGLE-PARTICLE DYNAMICS ARE ADEQUATE 



We confidently conclude that particles in the beam can be taken to behave classically throughout the pickup wiggler, 
and furthermore, assert that they will follow classical trajectories determined by the initial conditions and the external 
wiggler magnetic fields (and other external focusing fields, if present) and perhaps the mean self-fields, with negligi- 
ble perturbation from the spontaneously-radiated pickup radiation itself, cither through recoil (radiation-reaction), 
coherent forcing, or incoherent scattering. 

Under these conditions, we will see that it is possible to consistently treat the physics in what we will call a "hcmi- 
classical" formalism, where the particles are treated classically but the radiation is described quantum mechanically, 
in contrast to the "semi-classical" regime, a description usually applied either to a system in some sense poised 
between quantum and classical behavior and approaching a correspondence principle limit or otherwise amenable to 
a long wavelength asymptotic analysis, or else to interactions between matter and fields in which the matter must be 
described quantum-mechanically but the radiation can be described classically |74j. 

While not strictly necessary for the validity of the hemi-classical description, it will nevertheless be quite convenient to 
also argue away even the Vlasov (mean) fields, so that in fact the beam can be simply described by the single-particle 
physics of classical charges moving in fully external fields. 



A. Radiation Scattering Effects Are Small 



We have already seen above that radiation reaction effects may be neglected, so the effects on a given particle of its 
own wiggler radiation are unimportant, But this radiation actually slips ahead of the particle, and might interact 
with other beam particles downstream. The simplest way to check whether these effects might be important is to 
compare the density of actual radiated photons inside the wiggler to the effective density of virtual wiggler photons. 
In the average rest frame, the latter photon density is given approximately by 

<~™ _1 xS' (82) 



Using ( 30 1 , and assuming the radiation from each particle is predominately radiated within the synchrotron opening 
angle ^ in the lab frame, and remembering that a given particle will feel the fields from about N s other particles while 
traveling through the wiggler, the density of radiated photons experienced by a particle in the average rest frame is 
at most about 

<, d ~4a2^i, (83) 
and demanding n' ad -C n' u , after a little algebra we deduce the requirement 

a 2 7 2 iV s « f Ar 2 §, (84) 
which is expected to be satisfied with many orders-of-magnitude to spare. 

With this density of radiation photons, the mean free path lc for Thomson/ Compton scattering from any one beam 
particle may be roughly estimated in the lab frame by setting 

4a 2 A2Ir4 d « 1, (85) 

so we will have l c <C N U X U provided 

a 2 | 7 2 ^a 2 «l, (86) 

which is also readily satisfied, so scattering off the pickup radiation by downstream particles is completely negligible. 
Besides, even if subsequent Thomson or Compton scattering from the radiated field does occur, the momentum 
transfers will be of comparable magnitude to the direct radiation recoil terms during the original emission, already 
known to be negligible. 



24 



B. Mean Space-Charge Forces are Mostly Small 



Assuming a long, nearly azimuthally- symmetric beam with uniform particle density rib, the average transverse 
Coulomb-Lorentz force due to self-fields on a beam particle at a distance r from beam axis is 

F r (r) = ^pr, (87) 

so that the average electric and magnetic transverse forces due to mean self-fields cancel to O for a symmetric 

beam. The residual repulsive force will be negligible compared to the Lorentz force from the wiggler provided 
F r (a±)^^ < a u mc, or 

ol — ^—^N u na±X u X c < 1, (88) 

which is readily satisfied in the stretched beam. To be completely negligible, this repulsive self-force should also result 
in transverse beam spreading which remains small compared to the optical wavelength over the interaction time, i.e., 

I^(^)[^] 2 «Ao,or 

a— N 2 ncr±\ u \ c < 1, (89) 

7 

which is also satisfied provided the beam is sufficiently stretched. For a long uniform beam, the average longitudinal 
force on an randomly chosen particle vanishes by symmetry, but the RMS force does not, and particles away from the 
midpoint of the beam will experience Coulomb repulsion. The velocity is highly relativistic, so particles are inertially 
stiff, but on the other hand, the relevant length-scale for perturbations is only the optical wavelength Ao over the full 
interaction time At in the wiggler. At about one standard deviation into the tail of a Gaussian beam, the Coulomb 

repulsion will scale like F z <~ nq 2 n^ L . The resulting spread over the interaction time should be small compared to 
the optical wavelength, i.e., 



2 [7 + /3 V] to 

which becomes 



F, 



< A , (90) 



aNl-na 2 perp \ u — < 1, (91) 
7 &z 



which is easily met in our parameter regime. 



C. Fluctuations and Collective Oscillations are Mostly Small 



In addition to establishing that the average space-charge effects are negligibly small, to we must additionally show 
that the fluctuations and collective oscillations about this average behavior, arising from self-fields, are also negligible. 
Collective plasma oscillations will be unimportant if cither the relativistic plasma frequency u' p in the average rest 
frame is sufficiently low so that oscillations simply do not have time to develop during the interaction time, or if 
any Langmuir waves that might arise are heavily suppressed via Landau damping mechanisms, which will occur if 
the typical wavelength A^ for plasma waves is small compared to the Debye screening length A^. In the average rest 
frame, the relativistic plasma frequency is given in Gaussian units by 
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(92) 



where from relativistic length contraction n' = n/7 and At' w N ^ u , so demanding uj' p At' <C 2tt, we find 



2a—N 2 n\ 2 u X c 



1/2 



< 2tt. 



(93) 
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which is expected to be satisfied, at least for sufficiently massive particles like muons or protons (but possibly not for 
electron beams). 



In the average rest frame, the Debye length is A^, = 



v£, — y 4 ^°2^/ ; where X" is the effective longitudinal rest-frame 
temperature. From rclativistic velocity addition, we have seen that 



k B T' « mc 2 (/3' 2 ) w mc 



/3V' 



(94) 



assuming 7> 1 and 1 < 1. Therefore the rest-frame Debye length can be written in terms of lab- frame quantities as 



A' 



a\ c n 



7 ' 



(95) 



The rest-frame wavelength A^ for plasma oscillations is expected to be comparable to that of the wiggler period in 
this frame, A^ ~ A^ = \ u /l, so that plasma oscillations will be damped provided A^ <C A^,, or 



2aA„A c n 



< 1. 



(96) 



Because the wiggler period in the envisioned OSC scheme is so long compared to most other applications, this may be 
only marginally satisfied or may actually be violated, but because the beam is diffuse and the total number of wiggler 
periods is so small and the resulting interaction time so short, waves that are not suppressed by Landau damping will 
not really have much time to arise, as demonstrated in (93 1 above. 



While we can and will therefore neglect collective oscillations here, although these conditions could fail for certain 
beam parameter regimes, and then the effects of collective space-charge oscillations would in principle need to be 
included in the particle dynamics for a fully consistent treatment. THis is left for future research. 

Collisions, or binary (but possibly shielded) Coulomb interactions between discrete particles not accounted for in the 
Vlasov (mean) fields, could also cause deviations from the ideal wiggler orbits. Including Debye screening effects, 
the effective Coulomb scattering frequency v' c in the average rest frame (including the cumulative effects of many 
small-angle scattering events) is approximately 



In (A') 



m 2 c 3 <5/3' 3 

where 8/3' is the RMS velocity in the average rest frame, and where 



(97) 



A' = A'-V 3 
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(98) 



is the rest-frame plasma parameter, and satisfies A' ^> 1, while typically In (A') < O(10) or so. Comparing to the 
plasma frequency, we see that 



In (A') 
2ttA' 



(99) 



so that the collision frequency is expected to be much smaller than the plasma frequency, and is expected to remain 
unimportant. Specifically, Coulomb scattering will be negligible if v' c At' <C 1, or 



In (A') 

— - — - -c 1 



(100) 



which is readily achieved for beams of interest. 



Finally, while we have seen that we can neglect the effects of the space-charge forces on the average density n, or 
equivalently on the one-particle distribution function, we seek to establish that space-charge effects will not lead to 
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significant correlations in the two-particle distribution function, so that the relative positions of beam particles can 
still be described by classical shot noise. Such shot noise is characterized by Poissonian statistics within any given 
spatial region, or equivalently by particle positions chosen randomly and independently in proportion to the average 
density n, and so shot noise in any one Lorentz frame will appear as shot noise in any other frame, just with an 
appropriately Lorentz-transformed density. In the average rest frame, Coulomb repulsion is expected to lead to a 
certain amount of anti-correlations in particle positions, which may develop over the full lifetime of the beam, not 
just the interaction time in the wiggler. Roughly on the time-scale v~ l corresponding to the Coulomb scattering and 
dielectric screening times in a plasma, we expect the beam to have approximately approach a state of thermodynamic 
equilibrium in its average rest frame, at least in the absence of cooling and when neglecting the slow expansion of 
the beam, and hence in order to estimate the magnitude of these particle correlations we can make use of well-known 
results from equilibrium statistical mechanics, without having to solve a much more difficult time-dependent problem 
in kinetic theory. 

In the average rest frame, consider a sub-volume SV small compared to the beam as a whole but large enough to 
contain many particles on average. (Strictly speaking, we actually need not demand that SN' 3> 1, but only that 
SN' <C Nb.) Treating this region as a thermodynamic system in thermal and diffusive equilibrium with the rest 
of the beam temporarily regarded as a heat and particle reservoir, it is easy to see that in the absence of particle 
interactions, the particle number SN' in this volume will be subject to Poissonian fluctuations. In the non-degenerate 
(low-n and/or high-T) limit of indistinguishable free particles, the partition function for the region of interest can be 
taken to be 



Z'(T',SN',SV) 



where 



nQ 



[ n Q 5V '] 
SN'\ 



SN' 



(101) 



(102) 



is the quantum concentration, corresponding to one particle in a volume whose linear dimensions scale like the thermal 
dc Broglie wavelength X' dB , and where for simplicity we assume the transverse and longitudinal temperatures are equal. 
Using the usual Stirling approximation for the factorial term, the Hclmholtz free energy is given by 



F'(T',SN',5V) = -k B T' In Z' = SN'k^T' 
so the chemical potential is 



H'{T',5N',5V) = J^F'(T\SN\SV) = fc B T'ln I 5N ^- 



The grand partition function can then be written as 

oo 

Z'(T', l i',SV') = 



,[n> Q SV>] SN ' 



SN'=0 



SN'\ 



e X'n' Q 6V' 



(103) 



(104) 



(105) 



where A' = e k B T ' is the absolute activity. The average particle number within SV is then 

SN'(T',h',SV') = (SN'(T',»',SV')) = X'-^ In Z'(T',X',SV') = X'n' Q SV' 

and the probability distribution for particle number can be written as 

pf(mf] 1 [Xn' Q SV'] SN ' (SN'r'e-^') 
1 ' Z> SN'\ SN'\ 

which is a Poisson distribution, as expected, so that particle fluctuations satisfy 

'(SN' - (SN')f) = (SN') . 



(106) 



(107) 



(108) 
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When the effects of Coulomb interactions are included, the general problem becomes somewhat intractable, but a 
perturbation-type expansion for the partition function or free energy can be found in the typical ideal plasma regime, 
corresponding to large Dcbye shielding, A = \%n' ^> 1, or equivalcntly weak coupling: 



k n T' 



<1, 



(109) 



so that that the average Coulomb potential energy between neighboring particles is small compared to their mean 
thermal kinetic energy. For a single charged species, the Helmholtz free energy becomes 



F'(T', 6N', 6V) = 6N'k B T' 



In 



SN'/SV 



SN' 
SV 7 



(110) 



Because of the long-range nature of Coulomb force, successive terms in the expansion include various fractional powers 
and even a logarithm of the local density , so this is not a simple virial expansion in the density, and because of the 
appearance of q 3 , neither is it an analytic perturbation expansion in the natural coupling parameter q 2 . But assuming 



the perturbation parameter (109) is very small, we need include only the leading correction, the so-called Debye term, 
and need not worry here about non-analyticity or convergence of higher-order terms. Typically, this free energy for 
a Coulomb gas is derived using textbook Debye-shielding arguments (see for example [T7]) leading to an effective 
screened potential energy of Yukawa form, but such derivations assume net neutrality of the plasma. However, the 



expansion (110 1 remains valid even for non-neutral plasmas, and can be derived from quantum or classical statistical 
mechanics by re-summation techniques, or in a kinetic theory formalism by truncating the BBGKY hierarchy beyond 
the two-particle distribution function, a valid approximation when the inter-particle correlations in fact do remain 
small, which can be self-consistently checked at the end of the calculation. 

At least in the plasma or Debye-shielded regime, the long-range nature of the Coulomb force is both blessing and 
curse; it is typically impossible to treat exactly, but the statistical correlations which can develop between a given 
particle and any one other randomly chosen particle in the plasma will on average be small because of the screening 
effects of all the other intervening particles, and the long-range correlations which do persist result from the self- 
averaged effects of many particles and can be incorporated through a self-consistent mean-field (in our case the beam 
space-charge field), leading to the well-known collective effects, as distinct from true many-body correlations that can 
only b described via a joint probability density function. 

While it is somewhat troublesome to find a convenient expansion for the full probability distribution for the fluctuation 
SN' , it is straightforward to find the so-called Fano factor, or ratio of variance to average, sufficient for our purposes. 



Using the Helmholtz free energy (110 1, the chemical potential for the Coulomb gas becomes 



v(T',6N',6V) = J^F'iT'^N'^V) = fc B T'ln 





' 1 SN'~ 




k s T' SV 



1 



(111) 



Shifting from the canonical to the grand canonical ensemble, the chemical potential /j,(T',SN ,6V) evaluated at 
the average particle 6N' is formally invertible, and can be taken to implicitly define the average particle number 
6N' (T' , pi ,6V) = {8N' (T' , p' ,8V')) as a function of the chemical potential p'. The variance in particle number can 
then be expressed as 

/ (SN' - (SN')) 2 ) = k B T'^- (SN'(T',p',SV)) = -= 1 -= , (112) 

V I dp' K K ;/ ~g2=ii'(T', 5N', 5V) y ' 

which after a little algebra can be written as 

l2 \ SW 



(SN'-iSN'))' 



(113) 



n ■ 



n^\ 3 ' 2 ' 



28 



so the actual Fano factor may be written simply as 

(SN'-{SN'))' 



{SN' 



1 - 



(114) 



16ir A' 



Provided the weak-coupling condition (109) holds, we see that any deviations from Poissonian fluctuations are indeed 
small, at least at the level of first and second-order moments. 



In summary, to a very good approximation, we may therefore assume that that particles in the beam behave classically 
in the pickup wiggler, even as the particles radiate, and that their relative positions are well approximated by classical 
shot noise, and that their trajectories are determined almost exclusively by the external electromagnetic fields and 
their individual initial conditions at the entrance to the wiggler, and are in most cases largely unaffected by collisions 
or space-charge forces or by the radiation field itself, either through coherent or incoherent scattering or through 
radiation reaction|75j. 

With this classical model of prescribed particle trajectories, the subsequent analysis can be tremendously simplified. 
To make matters even simpler, off-axis effects and all betatron or other transverse motion of particles are also 
ignored, accept for the idealized ID quiver motion in the planar wiggler field, as determined by canonical momentum 
conservation. Higher harmonics of particle motion in the planar wiggler are ignored, so the longitudinal motion is 
taken to be uniform. 



XI. QUANTUM MECHANICAL DESCRIPTION OF WIGGLER RADIATION 



An exact analytic description of the quantum mechanics of emission would be prohibitively difficult, requiring a 
complete treatment of the dynamics of the coupled particle-field system, and in general leading to highly entangled 
states for the matter and radiation. But in the regime established above, where recoil effects are negligible and 
particles follow classical orbits that can be determined independently of the actual radiation fields, the problem is 
simplified tremendously. 

However, before we can deduce the quantum state of the radiation, we must digress to summarize a quantum optics 
formalism that can capture the collimation, collection, transport and amplification of the wiggle radiation. 



A. Quantum Optics for Dielectric-Guided Paraxial Beams 



After emission in the pickup wiggler, travels essentially paraxially down a mostly open beam line, but which does 
contain various passive dielectric lenses used collect, and transport the radiation, as well as one or more apertures 
to select transverse mode characteristics, in addition to the active gain medium of the amplifier or amplifiers. If the 
field is quantized in the vacuum, then the effects of dielectric elements must be included as interaction terms in the 
Hamiltonian coupling a large number of vacuum modes, complicating the dynamics. 

Instead, we can quantize directly the "macroscopic fields" in the presence of an idealized linear medium described 
by susceptibility tensors. Numerous approaches to this problem may be found in the literature, for example those in 
[TSlOIJltniCIIlllIllilJIM even earlier antecedents. Most are restricted to special cases 

where the medium is homogeneous and isotropic, or the dielectric tensor is frequency-independent, or only certain 
kinds of frequency response are allowed. The more general problem, involving anisotropy, slow spatio-temporal 
variations in the medium response, and arbitrary frequency dispersion consistent with the Kramers-Kronig relations, 
remains an area of current research |76j. and certain fundamental debate persists [771. 

Here we consider the unmagnetized case where /z = 1 while the dielectric tensor e(x) is real (i.e., lossless), positive, 
isotropic, dispersionless to a good approximation over the wiggler and amplifier bandwidths, but possibly spatially 
inhomogeneous, i.e., piecewise smooth in its dependence on position x. This model allows for transport of the radiation 
by dielectric mirrors or by ideal lenses without chromatic aberration, In practice, some dispersion will be present, so 
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here e(x) is chosen to best represent the average effects of the actual lenses or other optical media along the beam 
path. Perfectly-conducting boundaries [78] can also be included to model apertures, beam pipes, or metallic mirrors 
[75] . The resulting eigenmodes in general possess more complicated spatial structure than simple plane waves, but 
will retain harmonic time dependence. The exact form of these modes is not actually needed for our purposes here, 
but in the vacuum regions can be well approximated by the usual paraxial Gauss-Hermite basis set. 

Here we just summarize the main tools and results. In the presence of the hypothesized medium, the macroscopic 
fields can still be written (in Gaussian units) asB = H = 'VxA and E = e~ 1 D — — V0 — ^-j^A. We choose to 
work in the generalized Coulomb, or e-transverse, gauge, where 

V-[eA]=0. (115) 

For any piecewise-smooth dielectric function satisfying < e(x) < oo, it can be rigorously shown using generalizations 
of the Helmholtz theorem and existence and uniqueness results for the Laplace equation, that this is a well-defined 
gauge choice leading to a unique decomposition of the electric field into an e-transverse component containing all (but 
not necessarily only) radiation and an irrotational (curl-free) non-radiative component. 

From the macroscopic Maxwell's Equations, we deduce that the scalar and vector potentials potentials must satisfy 



V • [eV$ = -Awp, (116a) 

4-7T £ 
-J+-; 

C C 



-V x (V x A) - j,^A = - -J+ * V0 (116b) 



where p and J are the "free" charge and current flux densities, respectively, for any charges not implicitly accounted 
for through e(x), namely those in the particle beam. 

Given the assumed good behavior of the dielectric function, it can be verified that the generalized Poisson equation 



(116a) has existence and unique properties essentially identical to the standard Poisson boundary value problem. 
The scalar potential <p is therefore determined by the instantaneous positions of all the additional charges, as in the 
usual Coulomb gauge, and satisfies an unretarded elliptic equation, hence does not embody independent dynamical 
degrees-of-freedom for the EM fields. 

Separation of variables is used to express the vector potential A in terms of paraxial eigenmodes Uj(x;ui) of the 
generalized Helmholtz equation: 

e(a;) _1 V x V x Uj(x;w) = ^Uj(x;w); (117) 

Here the set of integers labeled by I is taken to index the transverse polarization and modal structure, assumed to 
be discrete, as in a paraxial beam or a waveguide, so I may have multiple components which arc implicitly hidden in 
the notation for brevity. The frequency u> labels |80j what is in general a continuum of longitudinal modes. 



The generalized Helmholtz operator appearing in (117) is not Hermitian with respect to the usual £2 inner product 



on M , but it is Hermitian with respect to the e- weighted inner product: 

(,,,),=/#«*)»(«)•,(«), (118, 

and with suitable boundary conditions can be naturally extended to a self-adjoint operator, and the eigenmodes may 
then be chosen to be orthogonal with respect to this inner product: 

(u e (x; u) , u e ,(x; J)) t = St e 8(uJ - u/), (119) 

where Sigi and S(u) are the Kronecker delta symbol and Dirac delta function, respectively. 

These modes will be complete in the sense of spanning the space of e-transverse vector fields in the quantization 
region, so the vector potential A — A e ± for the radiation fields can be decomposed as 

00 

A(x,t) = ^2 I 'dujc^J^ [ae(u;)u e (x;uj)e- %UJt + a e (uj^u* l (x;uj)e +tuJt ] , (120) 
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where the macroscopic field operators ae(ui) satisfy canonical commutation relations 

[ae{w),at'{u'y] = 8it>5{u — (J) 
[at(uj),ae(u>') ] = 0. 



(121a) 
(121b) 



In the presence of the dielectric medium, the "quasi-free" Hamiltonian governing propagation in the absence of free 
charges becomes 



where constants and other terms which do not depend on the field operators have been dropped. As in the vacuum 
case, we see that the free-field Hamiltonian remains equivalent to that of a collection of uncoupled harmonic oscillators. 
The source/field interaction Hamiltonian remains 



Note, however, that the macroscopic operators aj(u>) are "dressed," in general containing inside regions with nontrivial 
permittivity, contributions from both the usual creation and annihilation operators associated with the microscopic 
fields in free space. 

Recall that here the background dielectric function must be real and independent of time and/or frequency. Any 
effects of dispersion or attenuation must be incorporated explicitly through additional interaction terms. 

The gain in the active amplifying medium must also be treated quantum mechanically, but a specific atomic model 
will not be needed; very general features of quantum mechanics will be sufficient to characterize its best-case noise 
properties without explicitly evolve the radiation fields through the system. 

One could argue that by neglecting dispersion we are throwing away a source of noise for the radiation. Dispersion at 
some frequencies implies absorption at other frequencies according to the Kramers-Kronig relations, while fluctuation- 
dissipation theorems suggest that this absorption must also be associated with noise. Since quantum noise in the 
radiation field is of central concern, one might reasonably fear that ignoring dissipation and the attendant fluctuations 
in the lenses or other dielectric elements is a terrible idea. However, there are no fundamental lower bounds on the 
losses and fluctuations introduced by passive dielectric elements in any given bandwidth, so this perhaps is a useful 
idealization, because the frequencies at which absorption occurs and noise is injected are hopefully well removed by 
design from the bandwidth of interest. If not, because we will assume the radiation propagation and amplification are 
entirely linear, we can actually lump any dispersion, attenuation, gain, and noise into a single generalized amplification 
process, and need not ascribe them to individual elements. 

We will also assume all lenses are suitably coated to suppress reflection losses, which would otherwise lead to additional 
"partition noise" as in a idealized beam-splitter, because any unitary description of such devices would couple into 
the transmitted mode part of the vacuum fluctuations from the additional mode. Again, we are seeking a best-case 
analysis, and any extra additive noise could be easily incorporated at the end of the calculation. 



At last we almost equipped to determine the quantum state of the radiation field due to emission in the pickup wiggler, 
and after it passes through any passive, linear, collection optics. 

We have seen that, with high accuracy, the dynamics can be treated in our so-called "hemi-classical" approximation, 
where the particles and static wiggler field are treated classically, without any radiation reaction forces, while the 
light is treated quantum mechanically, a regime complementary to the usual semi-classical approximation, in which 
the matter and its Coulombic interactions are treated quantum mechanically but the radiation fields are assumed 
classical. 



oo 




(122) 




(123) 



B. Hemi-Classical Wiggler Radiation 



First we briefly review some important properties of the coherent states, first discussed by Schrodinger but introduced 
and extensively analyzed by Glauber [3SJ [33J 132 ES] in the context of the quantum theory of optical coherence. 
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1. Coherent States 



As eigenstates of the modal annihilation operators for the fields, such coherent states possess many useful mathe- 
matical and physical properties, a few of which will be briefly reviewed here. (See, for example, |36] for a thorough 
modern discussion.) Given a complete orthonormalized set of field modes, the (multi-mode) continuum coherent state 
|{o^(w)}) is by definition an eigenstate of each modal annihilation operator ag(u)) with complex eigenvalue ai(uj), i.e., 

a t (u)) \{a t (w)}) = a t {u) \{at(u)}) . (124) 

The choice agiuS) = corresponds to the vacuum for that family of modes. The prescription ae(u>) = for all possible 
I and lu corresponds to the overall vacuum state |0) for the dressed photon field. In the case of a fully discrete set 
of modes, where only certain discrete set of natural frequencies Lvi k are allowed, this multi-mode Coherent state is 
manifestly separable, and can be written as tensor product (^) \ot(. k ). 

Ik 

By convention, phases are chosen so that the coherent state can be written as 

\at(u)) = D[at(w)]\0), (125) 

where 

D [ai{u))\ = e' (126) 

is the unitary modal phase-space displacement operator, which satisfies D [— ai(oj)] 1 = D [— ai(ui)\ = D \ai(w)y and 
D [ae(uj)]^ ae(uj) D [ae(uj)] = a^(w) + ai{uS), We see that Glauber coherent states just consist of displaced vacuum 
states, so they possess the same quantum mechanical uncertainties as the vacuum but with a non-zero bias to mode 
amplitudes. The states are well-defined and normalizable provided that ^2 e fduj \ai(oj)\ 2 < oo. 

Using this definition of the coherent state, it is straightforward to show for a discrete mode the coherent state can be 
written as 

\a th ) = \ai(co ik )) = V 1 ik} * \n tk ), (127) 



in which 



I"«> = ;t=t[«LP|0) (128) 



is the usual number eigenstate for this mode. From (1271, it is evident that coherent state will have Poissonian 
photon-counting statistics under ideal measurement conditions. 

Glauber coherent states are also minimum uncertainty states in two important senses. For simplicity , let us consider 
the case of discrete modes. For any choice of overall phase 4> € R, the Hermitian quadrature operators, or sine-like 
and cosine-like components of the field amplitude operator a.j(uji) are defined as 



X lk — X, h — \ 



which is said to be in-phase, and 



b a lk - e*al 



(129) 



(130) 



which is said to be in quadrature. The Hermitian quadrature operators can be shown to satisfy 

[X tk ,X t , k ,] = 0, (131a) 

[Y ek ,Y e , k ,] = 0, (131b) 

[X ek ,Y e , k ,] = i6 u ,6 kk ,. (131c) 
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Defining AX^ = Xtk — {Xik) and AY^ = Y^ — (Y^), it follows form the generalized Hcisenberg- Robertson uncertainty 
principle that 

(AXl) (AY e 2 k ) > I \[X ek ,Y ek }\ 2 = i (132) 

It is easily shown that any coherent state symmetrically saturates this uncertainty- principle limit, in the sense that 
the variances satisfy ((AXa) 2 ) = ((AYtt) 2 ) = j. Also, while it is impossible to define a true Hermitian modal phase 
observable $>ik conjugate to the modal number operator = a\ k agk, under reasonable definitions of a quasi-phase 
operator, such as the Barnett-Pegg phase [36J , the coherent states also saturate or nearly saturate the "number-phase" 
uncertainty principle: ((AN tt ) 2 ) ((A$ tt ) 2 ) > |. 

Coherent states also have important overlap and over-completeness properties, but these will be introduced as needed, 
as well as a beautiful group theoretic structure which we will not need here. 



2. Optical Correspondence Principle 



Central to our arguments is the so-called Optical Correspondence Principle, a theorem first established by Glauber (31] 
in early work on the theory of coherent states, and then generalized independently by Glauber |37j and Sudarshan and 
Mehta[38 39 . As it turns out, essentially the identical theorem holds within our formalism for "macroscopic" radiation 
propagation in a linear medium described by a real, positive e(x) and possibly bounded by perfect conductors. 

If the initial state of the radiation field is a multi-mode Glauber coherent state (possibly, but not necessarily, the 
vacuum state), and the free sources all follow prescribed classical trajectories, then the final state of the radiation 
field is also a multi-mode Glauber coherent state, whose expectation value is given by the corresponding solution to 
Maxwell's equations with the same sources and propagating through the same medium, with classical initial conditions 
given by the expectation value of the initial coherent state. 

If the sources of interest for the radiation, namely the charges in the particle beam in the present case, can be taken 



as providing a prescribed classical current, then the interaction Hamiltonian (1231 can be written as 



oo 

JCt(*) = -J2 [ dLU [Mu;t)*a e (Lu) + J t (w;t)a t (w)t] , (133) 
* o 

where 

Jt{w;t) = -y/^ J d 3 x u e (x;uj)*-J(x,t). (134) 

As long as the current density J(x, t) is considered a prescribed current density, rather than the result of self-consistent 
trajectories of charged sources moving in total fields, it need not be expressed explicitly in terms of canonical particle 
coordinates, and the above form for the interaction Hamiltonian thus holds even if the actual sources are arbitrarily 
relativistic. 

We see that each mode in the medium still evolves independently if subjected to prescribed sources, so for simplicity 
we will drop all subscripts and indices cluttering the notation and look at a single mode, which evolves according to 

JT(t) = huj a} a - [3(t)*a + d{t)a}} (135) 

for a given c- number function 3(t). Note that this Hamiltonian looks exactly like the corresponding Hamiltonian in 
absence of the dielectric medium and with the same prescribed currents, but here the definition and meaning of each 
dressed modal annihilation operator a is generally somewhat different than in a vacuum; by design, the effects of the 
medium are incorporated (hidden) in the spatial structure of the dressed modes, determining what kind of excitation 
(i.e., dressed photon) the operator a annihilates. If the initial state for this mode is described a Glauber coherent 
state up to some overall phase, i.e., if IV'(O)) = e l< ^°) |cc(0)) for some real number <^>(0) and complex a(0), then we 
claim that in the Schrodinger picture, the state at later times is still given by some 

|V(t)> = e*® \a(t)) , (136) 
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where \a(t)) is a coherent state with standard phase. Because the coherent states are over-complete, the state ( |136 1 



will be a solution to the time-dependent Schrodinger equation corresponding to the Hamiltonian (135), provided 



{(3\ihf t \rP(t)) = (f3\J?(t)\ij(t)) (137) 

holds for any multiple of any arbitrary time-independent Glauber state \0), i.e., for any state such that (f3\ = (3* (/3\ 
and ^(3 = 0. Using the eigenvector conditions for the coherent states \a(t)) and the inner product in the form 

((3\a)=e-^ 2+a2 ~^' a ), (138) 



and the over-completeness relation 



d 2 f3 



= I, (139) 



where d 2 /3 = dRc((3)dIm((3) and X is the identity operator on the relevant modal Fock space, we find that indeed 
\ip(t)) is a solution for all subsequent time, provided only that a(t) satisfies 



a{t)=-iua(t) + ti(t), (140) 



and 4>{t) satisfies 

= +\\ W)*a(t) - 3{t)a(ty] . (141) 

Note that the real quantity </>(£) is the overall phase of the quantum state vector for this mode, which does not appear 
in the corresponding density matrix, and should not be confused with the actual optical phase of the mode itself, 
whose expectation value may be taken as arg[a(i)], and is not directly related to 4>{t). Since the modal equation of 



motion ( 140 1 is identical to that associated with the classical evolution of H{t) with the same sources, the expectation 
value (a(t)) — a(t) simply tracks the classical field dynamics driven by the given sources, starting from a classical field 
given by the initial expectation value (a(0)) = a(0) of the quantum field a. Since this is true for each of a complete 
set of radiation modes, the result is proved. 



The essential idea is that the Heisenberg equations of motions for the electromagnetic field operators are exactly 
analogous to the classical Maxwell equations, or equivalently to evolution equations for harmonic oscillators (one 
for each EM mode), and so if the sources are the same prescribed c- number functions in both cases, we expect the 
solutions to be as nearly identical as is allowed by quantum mechanics. The average of the quantum mechanical solution 
coincides with the classical solution, while the uncertainties, quantified in terms of the variances and covariances of 
field quantities, remain just the minimum allowable by the Heisenberg uncertainty principle (because the amplitude 
and phase of each mode are conjugate), and are conventionally ascribed to "vacuum fluctuations." 

This is a crucial result to our main argument, and merits some elaboration. Coherent states can be thought of as 
phase-space displacement of the vacuum, i.e., a translation of the vacuum fluctuations by some average amplitude 
and phase angle, so that the resulting state retains the symmetric minimum-uncertainty or "zero-point" fluctuations, 
only centered about the new average amplitude and phase. For classical, prescribed sources, an initial coherent state 
remains a minimum uncertainty state, with the centroid just following the classical trajectory 

This correspondence between the classical solution and the average of the quantum mechanical state holds however 
dim the radiation; that is, regardless of how weak the sources are or how small the resulting average photon density 
is. The relative uncertainty increases, or in other words the SNR decreases, but the average behavior is still exactly 
classical. 



Note that Glauber's Correspondence Principle holds for arbitrarily relativistic sources, and the "prescribed" classical 
particle trajectories can include the effects of charges moving in non electromagnetic classical force fields, external 
classical EM fields such as the wiggler magnetic field, mean space-charge fields, or even the instantaneous (non- 
radiative) Coulomb fields of other charges, as represented in the scalar potential in the generalized-transverse gauge, 
just no feed-back from the radiation field itself. In general, any appreciable recoil during the radiation process, or 
subsequent scattering off of other particle's real (as opposed to virtual) radiation, can destroy Glauber coherence of 
the radiation field. In particular, any gain in the pickup wiggler due to the FEL instability must be negligible, so that 
all radiation is dominated by spontaneous rather than stimulated wiggler emission. 
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3. Optical Equivalence Theorem 



While the trajectories of the particles are describable by classical mechanics and assumed definite, they will not be 
known exactly but only probabilistically. If sources are classical but stochastic in this classical sense, i.e., definite in 
principle but unknown, then the radiation field will be in a statistical mixture of coherent states, each with positive 
statistical weight, i.e., a so-called proper mixture of coherent states, with the weights interpretable as a standard 
probability distribution reflecting ignorance. 

It turns out that any density matrix can be written as a weighted sum of projections into coherent states [34l l35| l40 | 
HU U3J H2] , if we are sufficiently generous in allowing the weights to become negative or include singularities or 
generalized functions. Specifically, for the case of discrete modes, we have the diagonal coherent-state representation 

Prad = j ' d 2 a ei (uJi)d 2 ai> 2 (u2) ■ ■ ■ P{a ll (w x ), a l2 (oj 2 ), ■ ■ .) |a^(wi); • • • ){att 1 (u) 1 ); • • • | , (142) 

where Pya^ (wi), ae_ 2 (u>2), . . . ) is known as the Glauber-Sudarshan P-function, or normally-ordered quasi-distribution 
function. 

Radiation from classical prescribed sources is described by states where the P function is non-negative and normaliz- 
able, so may be interpreted as a classical probability measure over the corresponding coherent states. In addition, the 
quantum measurement statistics of any state with such a nonnegative P function can be reproduced in a semi-classical 
model (i.e., quantum detectors, classical field) by using the analogous classical field. Conversely, all known manifesta- 
tions of truly quantum mechanical behavior in radiation, such as squeezing in number or quadrature, anti-bunching, 
violation of Bell's inequalities, etc., that cannot be reproduced in a classical or semi-classical model, involve states 
with negative or even infinitely negative P functions, leading to Sudarshan's Equivalence Principle\A2\ 143] . which 
asserts that non-negativity and normalizability of the P function are jointly necessary and sufficient [HI] for classical 
behavior of the radiation field. 

The closest analogs to classical radiation allowed by quantum mechanics correspond to coherent states or their proper 
statistical mixtures, i.e., to positive, normalizable P functions; and conversely, any state for which P is normalizablc 
and everywhere positive, so that it can be interpreted as a classical probability distribution, is in its field statistics 
and properties virtually indistinguishable from the corresponding stochastic classical field, regardless of how small 
the average photon number. Note that textbook requirement of large degeneracy (in the sense of large mean modal 
occupation, i.e., (Nj(bJi)) 3> 1 ) of the relevant participating modes is, strictly speaking, neither necessary nor sufficient 
for classical behavior of radiation. However, in practice increasing the degeneracy does tend to make observation of 
remaining quantum signatures (i.e., statistics which could not be reproduced classically) more difficult. Conversely, 
at low degeneracy, any state, including coherent states or thermal states, will be subject to large relative uncertainties 
in photo-counts, but this can be consistently attributed to the quantum nature of the atoms in the detector, because 
exactly the same statistics can be reproduced in a semi-classical theory. 

In many respects coherent states can be thought of as equivalent to a (classical) superposition of a deterministic 
classical field (itself representable as a deterministic superposition of plane waves or other classical modes of the 
optical system in question) and an unavoidable amount of stochastic noise associated with quantum mechanical 
uncertainty. 

Thus, we conclude that classical (but possibly stochastic) sources produce classical-looking (but possibly stochastic) 
radiation, as might be expected after a bit of reflection. 



4- Quasi-Distribution Functions and Quantum Characteristic Functions 



The Glauber-Sudarshan P function is the most natural quasi-distribution function from which to assess the classi- 
cal/quantum correspondence for radiation, but others are useful in different settings. Consider the case of a single 
discrete mode for simplicity, where all subscripts will be suppressed to avoid notational clutter [52]. We can define the 
s-ordered quantum characteristic function [UJ [31] in terms of the von Neumann density matrix p rad as 



X (e;s) = Tr[ Prad D(0]e s 2 



(143) 
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in principle for any sel, but in practice the most useful choices are s = +1,0,-1, naturally corresponding, as we 
will see, to normal-ordering, symmetric-ordering, and anti-normal ordering of the mode operators. As in classical 
statistics, the derivatives of the characteristic function are related to the appropriately ordered expectation values: 

(144) 

where [a^a m ] s denotes the s-ordered product, so that, for example, ([a^a]) = \a'a) + |(1 — s). 

The corresponding quasi-distribution functions are obtained by Fourier transforms, as in the classical case: 

F(a; s) = ^J d 2 £ s)e Q «*-«*«, (145) 

so that s-ordered moments are calculated just like normal moments of ordinary random variables: 

([a n a m ] s ) = J d 2 a a* l a m F(a;s). (146) 

The quasi-distribution functions are always real, but not necessarily non-negative for s > — 1, and may be highly 
singular. Each is is a smoothed version of the preceding via convolution with a Gaussian: 

F(a, s-\) = ljd 2 p F{p; s)e~ 2 ^ 2 . (147) 

The normally-ordered choice s — 1 corresponds to the Glauber-Sudarshan P-function introduced above; the 
symmetrically-ordered choice corresponds to the Wigner Function F(a; 0) = W(a) = \ Tr [L»(a) t nL>(a)p rad ] , where 
IT is a parity operator which flips the sign of the "momentum-like" quadrature component but leaves the "position- like" 
component unchanged; and the anti-normally ordered choice s = —1 corresponds to the nonnegative and nonsingular 
Husimi function. F(£; —1) = Q(a ) = - (a |/?, ad | ot) > 0. 

From the P function, we can directly determine whether the state can exhibit non-classical behavior. The Husimi 
Q function is useful to graphically represent the modal phase space "occupied" by the state, since it is always non- 
negative. The Wigner function corresponds to symmetrically-ordered (or Weyl-ordered) observables which often are 
of theoretical or experimental concern, and is the distribution that people implicitly have in mind when they speak 
of the vacuum fluctuations or zero-point motion as being equivalent to "half a photon" per mode. To see this, note 
that the characteristic functions for a thermal, or blackbody state containing an average of h photons are 

Xn (^ s ) = e^ m+1 - S ^ 2 , (148) 

so if s = 0, we see that in a thermal state, the contribution from vacuum fluctuations (i.e., the part independent of n) 
is exactly equivalent to the addition of half a photon. While obviously the P-function of a coherent state corresponds 
to a Dirac delta function, the corresponding W function for a coherent state is a Gaussian occupying about \h of 
quadrature phase space. 




5. Summary of Field State Prior to Amplification 



In practice, rather than working directly with the resulting density matrix, it will be useful to defer the averaging over 
the classical uncertainty in the particle trajectories until the end of the calculation. That is, because of the linearity 
of the quantum evolution and of any averaging over classical ignorance, we can propagate the conditional state given 
the particle trajectories, apply these fields to the particles in the kicker, and only then average over the initial particle 
distribution function, in order to determine the actual statistics for the beam cooling at each pass. 

This conditional state of the radiation in the coherent wiggler mode, given the particle orbits, will be a pure state, in 
particular a multi-mode |83j Glauber coherent state, such that before any amplification, its expectation value is 



n 



(149) 
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is equal to the corresponding classical solution consisting of the sum of the contributions from all the particles, while 
its variance will scale like 

(\AE( X ,t)f) «§^, (150) 

\ / pic v u 

where V u ~ ttWqN u \q Nu s^° 1S ^ ne original coherent mode volume in real space (not phase space). 



C. Amplification of Pickup Wiggler Radiation 



The exact quantum mechanical evolution of a mixture over such multi-mode coherent states during the amplification 
process will of course depend on precise details of the laser amplifiers, but by considering idealized quantum mechanical 
amplifiers, the best-case statistical properties of the amplified radiation and the minimum amount of added amplifier 
noise can be derived using just a few very basic assumptions about the input-relations. We will follow the very 
illuminating fully quantum treatment of Caves [45 , who extended and clarified pioneering analyses (both semi-classical 
and quantum) in [46j 071 [48j |49l |50j [HJ E21 [53j HH [55] and elsewhere. (See also [561 Hi] an d references therein for 
other insightful modern treatments of quantum amplifier noise) . 

First, it will be supposed that the amplifier output mode field operators depend only linearly on the input mode 
operators, implying that saturation effects in the output are ignored, although it need not be assumed that the 
internal dynamics within the gain medium or the interactions with the pumping or any other supplementary field 
modes are actually fully linear. The amplifier is further assumed to be time-stationary, or frequency-preserving and 
phase-preserving; i.e., an input mode at a given frequency is mapped into an amplified output mode at the same 
frequency (typically just the same mode, extrapolated forward along the paraxial optical path), and if the input is 
shifted in phase by some constant offset, then the output will be shifted by the same amount, so the amplifier has 
no absolute internal clock, nor any preferred phase bias. For transit-time OSC, this time-stationarity is desirable, or 
even essential, because the information used by the cooling mechanism to reduce particle phase space is stored almost 
exclusively in the phases rather than amplitudes of the light, and efficient field/particle coupling in the kicker will 
require that the light frequency remains near the resonant frequency of the particles in the wiggler. 

To develop the needed gain, the amplifier can be multi-stage (cascaded), but must be single-pass, i.e., non-regenerative, 
for two reasons. Firstly, the amount of total particle time-of-flight delay that can be realistically introduced between 
pickup and kicker compared to the proper time between these points (i.e., the time it would take light to travel 
between them in vacuum) is not very great for heavier particles like muons traveling at near- luminal velocities, so 
there simply is not time to circulate the light before injecting it into the kicker coincidentally with the particles that 
radiated it. Secondly, a regenerative amplifier operating at high pump power can easily be pushed over the oscillation 
threshold, where it converts from an amplifier preserving the phase and modulations of the input to a laser oscillating 
very cleanly but at essentially some random phase. 



1. Quantum Mechanics of Linear Optical Amplifiers 



In the Heisenberg picture, where time-evolution is ascribed to the operators, it is supposed that the amplifier output 
mode field operators depend only linearly on the input mode operators The amplifier is further assumed to be time- 
stationary, that is, frequency and phase-preserving. Consistent with the ID dynamics adopted throughout the current 
analysis, it is also taken to preserve transverse coherence, in the sense of maintaining a one-to-one relationship between 
input and output transverse modes. As demanded by quantum mechanics, the time-evolution of the field must be 
unitary, and the field observables must satisfy appropriate commutation relations and Heisenberg uncertainty relations 
at all times before, during, and after amplification. 

It turns out that these basic assumptions about linearity and lack of phase bias, along with the axioms of quantum 
mechanics arc sufficient to characterize the action of and best possibly noise figures for any such linear amplifier, 
without needing to know the specific details of the amplifier system, or explicitly following the evolution of all the 
DOFs in its gain medium. 
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In the Heisenberg picture, we take ag(cu) to represent the continuum field annihilation operator for a mode before 
amplification (i.e., the input mode operator) and bi(u>) to represent the annihilation operator after amplification (i.e., 
the output operator). The modes are specified by a continuum frequency lu as well as some discrete index j meant 
to represent different transverse paraxial modes, or otherwise lift degeneracy. The input modes satisfy the canonical 



bosonic commutation relations ( 121 1 as must the output modes, i.e. 

[b t (w), fc(w')t] = S tff 5(u - J) (151a) 
[6/(w),6//(w') ] =0, (151b) 

consistent with the quantum mechanical evolution remaining unitary. By linearity, we mean that the input and output 
modes are related by affine transformation, so we are disregarding any sort of saturation effects in the gain medium. 
The most general possible linear input-output relation is then 

b t (u) = f duJ ' [Mit>(w,w')a t >(u') +L ul (uu 1 J)a t {Lu i y] + F e (oj) (152) 

Next we demand that the input-output relations are time-stationary, meaning both frequency and phase-preserving. 
By frequency preserving we mean that input at frequency to is mapped to output at frequency u>, while by phase- 
preserving we mean that a phase rotation of the input leads to the same phase rotation of the output, and that if the 
input has time-stationary noise statistics, then so will the output. These are desirable properties for any amplifier to 
be used for transit-time OSC, and together imply that 

Mw (w, w') = M U ' (u)8(u - to') (153a) 
L w {uj,w')=0. (153b) 

Finally, we assume a one-to-one relationship between input and output transverse modes, so that the matrix Mu>(u)) 
is unitarily diagonalizable at any frequency to. Actually, without any loss of generality under our assumptions, and 
to avoid introducing new notation, we may suppose that the an (u>) and bi(uj) have already been chosen in the proper 
linear combination to correspond to the transverse eigenfunctions of the amplifier, so that Mpji{oS) is already diagonal: 

M w (uj) = S e e'M e (cj), (154) 

and the input-output relations for the amplifier become simply 

b t (w) = Mt(u)ae{ui) + F*(w). (155) 

Obviously Mp(uj) is the amplifier transfer function for the jth mode of frequency u>, such that Mi{ui)^Mi (u) = Ge(u>) 
is the power gain for the mode. In general, Mi(lu) can be an operator (g-number) commuting with all input mode 
operators: 

[ai{u) y M v {J)\ = [a t (u)\Mff(u/) ] = 0. (156) 

The additive noise operators Fg(ui) must be present to ensure unitarity. They represent independent degrees-of- 
freedom corresponding to some internal or auxiliary states of the amplifier, so can be taken commute with the input 
and output modes. In fact, in order that the bosonic commutation relations are preserved, we can deduce the 
commutation relations 

[a t (w),F t ,(u/?] = [a t (u),F e (a/) ] - 0, (157a) 
[Ft{u)),F v {tJ) ] =0, (157b) 
[Ft(u),F ff (u/)i] = 5 ee ,6(tu - to') [1 - M e (Lu)M e (Luy] , (157c) 

for the noise operators Fe(u). 

To complete the description of the amplifier, we must specify an initial state (density operator) for the scaling and 
noise operators. In the Heisenberg picture, time evolution resides in the operators, so this operating state is fixed, 
and should be independent of any possible input signal. That is, the density matrix of the total system (modes to be 
amplified plus all auxiliary modes internal DOFs of the amplifier) factorizes: po = p a <£> Paw 



No fundamental physical principle imposes any lower bound on the variances and covariances of the M((u), so in 
the ideal case they can be taken to be c-numbers, implying no multiplicative quantum noise in the output, that 



38 



could arise from variations in the gain via fluctuations in the pump strength, for example. Besides, such noise 
can easily be added back later, if needed. With Gi{ui) = \M^{uj)^ > 1 and deterministic, note that we can write 
Fe(u>) — y/Ge(u>) — 1 ft(u>)\ where fe(u>) is a bosonic annihilation operator for what may be interpreted as the effective 
internal amplifier mode that contributes the extra noise. Nothing prevents an assumption excluding bias in the noise, 
such that (Fi(lo)) — 0, and in fact our previous assumptions about phase preservation require it. 

But the commutation relations together with a generalized Hciscnberg Uncertainty Principle applicable to creation 
and annihilation operators constrain the second-order noise statistics. For the symmetrized covariances, we find the 
inequality 

\{Fz(ufF t ,{J) + F t ,{u')Ft{^)-{Fz(uf){F it (J)) > \ \ [F t (u), F*(u/)t] | (158) 
In particular, for coherent-state input, the output noise statistics, in terms of symmetrized covariances become, 

(&*(£«;)&</ (u/)) - (M"0) (^'(w')> = °; ( 159 ) 

and 

G l {u)& jj ,6{u-<J) [\ + St{u))] . 

The first term on the right-hand side represents the amplified "vacuum fluctuations" |84j present in the input, 
equivalent in noise to "half a photon per unit bandwidth" before amplification, while the second term represents 
the extra noise added by the amplifier (typically through amplified spontaneous emission in the gain medium), in 
which the added-noise spectral density St(ui) must satisfy: 

^H = i|i-e^|°?M> (161) 

where o\(u) > 1, with equality in the Heisenberg- limited case, so at large gain the added noise power is the equivalent 
of at least one additional "half photon" per mode of input to the amplifier. 

For a single mode, note that the fluctuations in photon number in the output are super-Poissonian. If the input is 
coherent, the Fano factor starts off as unity: ^^TT = 1 b u t a t output, becomes ~G+1~G^>1, where 

here N a = a) a and N b = tfb. The output state is chaotic, not coherent. 



D. Quasi-Distribution Functions 



For simplicity, we will temporarily drop all the indices and arguments and for the moment treat a single discrete input 
mode a, output mode b and internal amplifier mode /, assuming a gain factor G » 1. The multi-mode results are 
exactly analogous, only the notation is quite cumbersome. While we worked with symmetrically-ordered operators 
above, it is more convenient here to begin with normally-ordered characteristic functions and the corresponding 
Glauber-Sudarshan quasi-distribution function. 

We assume the input mode a is in a Glauber coherent state. Actually, it should be described in some statistical 
mixture of Glauber states, weighted by the particle distribution function, but again, that averaging will be deferred 
until later - in effect, we are here conditioning on supposed knowledge of the classical particle trajectories in the 
pickup wiggler. 

Recalling that the combined density matrix for the input and amplifier factorize, and the input and internal operators 
all commute, the normally-ordered characteristic function of the output mode is then 
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Now, for a coherent state input, it is easy to see that Xa(M*£, 1) = XMa(£, 1), i-e., the first term is the characteristic 
function for a deterministically scaled coherent state with expectation (Ma) = M (a) . The second term on the 
RHS is recognized as the normally-ordered characteristic function for a thermal (chaotic) state with an average of 
Af n = G — 1 photons. The final term is the characteristic function for the internal mode operator / after scaling by 
\/G — 1. Considerations of the central limit theorem or of maximum entropy suggest that the added noise appearing 
through the initial state of the / mode should, in the absence of further specific information, also be taken as thermal: 
1) = e - ^^! , where (/) = (/ 2 ) = • ■ ■ = 0, and we have defined Aff = (flf) > 0, so the characteristic function 
for the amplified field mode becomes 

X & (C,l)=XMa(e,l)e- ( ^ +1)(G - 1)l « |2 (163) 

According to the Fourier convolution theorem, we see that after amplification, the resulting Glauber-Sudarshan P- 
function will be a displaced chaotic state, or equivalently the sum of a multiple of the input coherent state and an 
independent additive chaotic noise term corresponding to about (Mf + 1) (l — thermal photons at input, where 



Nf —> in the ideal limit. It is easy to verify that the Fourier transform of (1631, i.e., the P-function, is nonnegative 
and normalizable — in fact it is just a Gaussian, centered on the appropriate multiple of the input state, and with a 
width characteristic of a thermal state. 



1. A Heuristic (Single-Mode) Picture 



Any one particle interacts in the kicker with radiation from itself particles within a coherence length, i.e., with radiation 
that is "almost' confined to one mode. Therefore, while keeping in mind that the quantum state of the radiation is not 
actually diagonal in any number state basis and that a more careful multi-mode analysis is strictly necessary because 
the spectral bandwidth is not vanishingly small and the effective number of longitudinal modes in the entire radiation 
field exceeds unity, we can nevertheless paint the following suggestive picture: before amplification, the radiation field 
from the pickup wiggler that would be seen by any one particle "consists" of, on average, the equivalent of: 0(a) 
self-radiated photons constituting the cooling signal; \ photon of "vacuum fluctuations," and 0(N s a) > 1 photons 
radiated from other particles in the sample. The relevant SNR is approximately aN ^372 ■ 

After amplification in an ideal, quantum mechanical phase-preserving linear amplifier, with overall gain factor G» 1, 
the field seen by the same particle is essentially classical but stochastic, containing the equivalent of 0(Ga) photons 
with proper phasing on average for cooling, ^G randomly-phased photons from the amplified vacuum fluctuations, 
at least another photons from the noise added by the internal amplifier mode or modes, and 0(GN s a) from 
amplification of the other particle's radiation, with approximately 0(G) photons at each of about 0(N s a) different 
random phases. The SNR becomes approximately ajv " +1 , comparable to, but slightly worse than, the pre-amplified 
SNR. 

Quantum amplifiers do not improve the SNR, but ideally amplify the input without serious degradation of the SNR, 
up to some sufficiently high level, where the field can be be measured, manipulated, or employed by course-grained 
classical means without any significant further loss of SNR by quantum back-action. But the essential point is that 
neither the pre-amplified nor post-amplified SNR is "catastrophically" affected by quantum mechanical fluctuations or 
uncertainties inherent in emission, or amplification, through multiplicative noise, but rather the effects are just roughly 
equivalent to having an extra 0(a~ l ) particles in the sample. Of course, this equivalence to extra sample particles 
is not precisely true, because the noise photons added by the amplifier will not have exactly the same spectrum or 
quasi-distribution function as the photons actually radiated in the wiggler, but if N s > O(10), the Central Limit 
Theorem suggests that the net field from all the other particles in the sample will also be close to a Gaussian chaotic 
state, as will the sum of the sample-particle and noise contributions. 



2. The Bottom Line on Quantum Amplification and the Amplified Field 



Conventional linear phase-preserving amplifiers do not first measure then multiply photon number states, but instead 
linearly amplify the entire input field, while unavoidably adding additional noise. 
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In each mode, the additive noise is equivalent at input to the original vacuum fluctuations, themselves equivalent to 
about "half a photon," plus another independent set of vacuum fluctuations, another "half a photon." 

As Caves [43] aptly put it, "quantum mechanics extracts its due twice." After amplification in a high-gain amplifier, 
the two non-commuting quadrature components of a field mode may be measured classically with little further 
degradation in the SNR. But simultaneous measurement of non-commuting variables must involve auxiliary variables 
that do commute, and whose uncertainties add to the uncertainty already present |85j . 

Once linearity is demanded of the amplifier, it must amplify input vacuum fluctuations along with both quadrature 
components of any signal, and indeed cannot distinguish between what we, the cooling system, or the beam particles 
might consider signal versus noise. There are of course states of the combined system consisting of the input fields 
modes and the amplifier for which the additional noise in the form of a second set of vacuum fluctuations is not present 
in the output, but they will necessarily involve correlations between the input and internal modes, but because any 
internal amplifier modes must be prepared independently of the input, their zero-point motion will be amplified and 
added to the output (physically, ultimately through the mechanism of spontaneous emission.) 

But apart from this doubled additive noise, quantum mechanics does not require anything like the speculated multi- 
plicative Poissonian noise due to "quantum jumps" that we found so deleterious in our naive model. 

Despite naive fears to the contrary, the signal does contain the needed phase information, albeit corrupted by a large 
amount of noise which, however, is random in phase (unbiased), independent of the input state, and purely additive. 

The noise added by the amplifier can be characterized by an effective temperature assuming it roughly thermal in 
distribution, or as an equivalent number of photons at input, or perhaps most approximately but most usefully, as an 
equivalent number of extra particles in the sample. Because the cooling signal in the coherent portion of the wiggler 
radiation spectrum was approximately 0(a~ 1 ) photons on average before amplification, while the total noise power 
at input is about O(l) photons, the amplifier noise is expected to be the equivalent of approximately N n ~ 0(a^ 1 ) 
extra particles in the sample. While not nearly as detrimental to cooling as the noise predicted from the quantum 
jump model, this will eventually saturate improvements expected from further beam stretching, and limit the ultimate 
rates of cooling and the equilibrium emittances and energy spreads achievable. 

When the gain is large, the radiation density matrix for the amplified field, conditioned on a given set of particle 
trajectories, is described as a biased chaotic state, essentially indistinguishable from a classical but stochastic light 
field consisting of the superposition of the deterministically-scalcd classical pickup radiation, together with some 
admixture of chaotic light characterized by a certain noise temperature proportional to the overall gain. Because the 
Glauber-Sudarshan P function of such a displaced chaotic state is everywhere nonnegative and well-behaved, the field 
can be interpreted classically. 

In fact, the post-amplified field values can be taken as Gaussian c- number random variables with means 

(E(x,t)) post *VG{E(x,t)) pxB (164) 

and variances 

(\AE(x, t)\ 2 )^ a G (\AE(x, t)\ 2 ) ^ (165) 

so the SNR is approximately preserved. Calculation of the cooling dynamics can hereafter proceed classically, only 
with the added stochastic contributions from the quantum vacuum fluctuations and the amplifier noise included. 



XII. COOLING DYNAMICS FOR A SIMPLIFIED MODEL 



After using the hemi-classical radiation model and this very general formalism for linear amplifiers, determination of 
the interaction of the particles and amplified optical field within the kicker and calculation of the resulting statistics 
for cooling kicks on each pass can proceed essentially classically. Because we are here primarily interested in questions 
of principle, and endeavor to demonstrate that quantum effects properly considered will not catastrophically affect 
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cooling rates, the cooling statistics can be worked out explicitly, albeit for a highly idealized, one-dimensional model, 
wherein we make a number of simplifying assumptions in order to achieve a greater measure of tractability. 

The average beam energy 70 3> 1 is assumed highly relativistic, and the energy deviations Sjj = jj — 70 are assumed 

sufficiently small, so that \5jj\ -C 70, and \jj — 7^1 <C 70, and also — vq\ <C c, where vq = c/3o w c^Jl — ^ 

is the mean longitudinal velocity outside the wigglers. All particles then approximately have the same longitudinal 
velocity and the same magnitude of transverse quiver momentum inside the wigglers, and their trajectories can be 
be determined from conservation of energy and of transverse canonical momentum. Only longitudinal cooling is 
considered here; the possibility of cooling of transverse DOFs, by making part of the time-of- flight delay proportional 
to betatron errors, and then exploiting dispersion in the lattice, is here omitted for simplicity; the calculation is similar 
if somewhat more involved. Because the beam is highly relativistic, the change in longitudinal momentum is related 
to the energy kick by 

Ptj ~ Po ~ 70 ~ 1j 

where po ~ mc-f /3o is the average longitudinal momenta of the beam particles. 

In the kicker, the wiggler fields are much stronger than the amplified radiation fields from the pickup and vastly 
stronger than any newly-emitted spontaneous radiation. So just classical wiggler fields together with the particle initial 
conditions at the wiggler entrance are assumed to determine particle spatial trajectories, with negligible perturbations 
from the radiation field or space-charge fields. Both wiggler fields are assumed to be planar and perfectly transverse, 
with no transverse variation over the extent of the beam, and exactly sinusoidal in their longitudinal variation, 
requiring \ u » a± and both satisfied with orders-of-magnitude to spare. That is, the wiggler magnetic fields 

in both kicker and pickup are assumed to be of the form B u (x) ~ ysm(k u (z — z u )) over the extent z u < z < z + N u X u 
of each wiggler, with fringe fields neglected. 

Off-axis effects and all transverse motion of particles are ignored accept the idealized ID quiver motion in the wiggler 
field, as determined by canonical momentum conservation. Higher harmonics of particle motion the occur in planar 
wiggler are ignored, and the time-averaged longitudinal components of the unperturbed orbits are deduced from the 
transverse quiver and conservation of kinetic energy in a magnetic field. The effects of space-charge forces and any 
plasma waves or other collective effects on the beam distribution are expected to be negligible and are ignored. Any 
external focusing is ignored as well, although in practice these forces may not be small. 

Specifically, with the effects of energy variation and possible off-axis injection ignored, the transverse motion for 
each particle inside the wiggler is determined within our approximations by the conservation of transverse canonical 
momentum conservation in the wiggler field only: 

(3 ±j (z) w ^-^a u sm(k u (z- z u )) for z u < z < z u + N U X U , (167) 

where we can use 70 rather than 7^ because the errors will be of higher order than terms retained in our approximate 
calculation. 

Because variations in longitudinal velocity between different particles are neglected, and the fast oscillating component 
of this velocity due to motion in a planar wiggler is ignored, for the longitudinal motion one finds 

/M*) ^OT+AoTA^ (168) 

so that 1 w ^ -C 1, and the time spent in the kicker wiggler is then Tj w N ^ for any particle. 

Spectral broadening or other effects on the spontaneous emission spectra from the pickup, due to off-axis injection 
or the angular deviations associated with the transverse quiver motion are ignored, and the expected (i.e., classical) 
emission from each particle is assumed to consist of exactly N u plane-polarized, sinusoidal oscillations at central 
frequency luq, emitted on-axis, with amplitude Eq, with expected phase determined by the random longitudinal 
position of the particle in the beam, and extending transversely over some spot size wq larger than the transverse 
beam size and the extent of a single particle's quiver orbit, i.e., for which w Q > a h ^ and w > At best, the 

latter conditions are probably only marginally satisfied with the present parameters, but for simplicity we do neglect 
for now any transverse structure or imperfect transverse coherence in the radiation fields. 



(166) 
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A single planc-polarizcd fundamental transverse (say £ = 1) mode of the pickup radiation is assumed to have been 
amplified by an ideal laser with a flat gain G > 1 profile and negligible linear or nonlinear dispersion over the 
bandwidth Au; L > Auj. 

Outside the amplifier, all radiation is assumed to propagate purely longitudinally according to the vacuum dispersion 
relation. After amplification, the field is stochastic due both to the original beam shot noise and the added amplifier 
noise, but is essentially classical in its statistics and dynamic behavior, consistent with hemi-classical quantum optics 
detailed above. 

The bypass lattice is assumed to introduce a time-of-flight variation linearly proportional to energy deviation 5jj = 
jj — 7o- The energy change in the kicker is calculated to leading order in - using the unperturbed quiver orbit of the 
particle in the kicker field. Any particular cooling kick on a single pass for a single particle will also be relatively 
small compared to the particle's total kinetic energy, i.e., |A7j| -C 7o — 1, but not necessarily very small compared to 
the energy deviation Sjj. 

The bunch as a whole is assumed to contain many particles, Nb 3> 1, and after stretching the beam density pb is 
assumed to be uniform over the entire stretched length, which is presumed very long compared to the sample length, 
i.e., Lb » N u \ n , so that the relative size of any sample is small: N s <C Nb. (A Gaussian bunch with a b \\ ~ Lb is also 
analytically tractable). Each sample will still contain a moderate number of particles, typically N s ~ O(10 2 ). 

Nearby particles will experience similar kicks and will become partially correlated, but if the sample sizes are small 
in relation to the beam as a whole, this correlation effect is negligible when averaged over all particles in the beam. 
So if good mixing between cooling kicks is assumed, so the noise effects in a given particle's kicks can be taken to be 
statistically independent from kick to kick, and the full kicks (including coherent and incoherent parts) can be taken 
to be effectively uncorrelated for different particles, then only the single-particle, reduced distribution function at one 
moment of time need be tracked. In fact, the energy distribution will be assumed to start and remain Gaussian, so 
that only first-order and second-order moments need by retained. 

Many of these assumptions are fully justifiable in our parameter regime, while others are rather drastic and will 
eventually need to be relaxed in a more realistic calculation, but should be adequate for our present goal, a proof-of- 
principle that quantum noise is no deal-breaker for the scheme. 



A. Bunch Stretching and Compression 



We model the initial bunch stretching and final bunch re-compression very simply, as symplectic maps which ultimately 
leave the transverse phase space distribution unchanged, while preserving the action of the longitudinal phase space. 

Recall that the stretching is to be accomplished by drift in a ring with a high-momentum-compaction-factor, in 
order to disperse the beam based on energy spread, followed by use of a LINAC to remove the resulting head-to-tail 
correlation between longitudinal position and energy. After cooling, the process is reversed to re-compress the beam, 
by first applying an appropriate energy chirp with the LINAC then allowing the beam to drift in the compaction ring, 
where now particles in the tail will tend to catch up to those in the head of the beam. Obviously, the transverse phase 
space cannot remain unaffected or even uncorrelated with the longitudinal phase space at intermediate times, during 
the actual stretching or compression, because particles with greater longitudinal momenta are being forced onto longer 
orbits by highly-dispersive beam optics, but it can hold to a reasonably good approximation after the expansion or 
contraction is completed. We assume that the stretching/compression ring is sufficiently long to avoid "wrap-around" 
effects, which will lead to striations in longitudinal phase space. The dispersive drift shears the occupied phase space 
horizontally, while the ramped kick from the LINAC can shear it more or less vertically. Assuming all the head-to-tail 
energy variation is removed by the LINAC, and correlations between transverse and longitudinal DOFs that build 
up at intermediate times are undone by proper lattice design, then from Liouville's theorem, supposing % and t a are 
some times before and after any stretching, then 

L b (t a )5j(t a ) w L 6 (t 6 )<f 7 (t 6 ). (169) 

If the bunch length satisfies Lbt a ^> Lj,(t(,), then the energy spread satisfies 5j(t a ) <C 6j(tb). Not only does stretching 
lower the beam density n b and the resulting sample size N s , but it also (reversibly) reduces the range of energy 
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variation S"f, which relaxes design constraints on the bypass lattice. If subsequent OSC then (irreversibly) reduces 
the energy spread by some factor jj, by time t c without affecting the overall bunch length, i.e., Sj(t c ) sa fiSj(t a ) while 
Lb{t c ) w L h (t a ), and then the bunch is re-compressed to its original length by the final time tf by reversing the 
stretching procedure, then 

*7(*/W7(*c)2^=^7(t6), (170) 

so the final energy spread after cooling and re-expansion remains reduced by the same factor \i compared to the initial 
energy spread of the incident (uncompressed) beam. 

The stretching/compression phase does raise some non-trivial practical issues that appear to have been overlooked. 
If N b ~ O(10 9 ) and L b (t a ) ~ C^ICT 1 m), then to get down to sample sizes of N s ~ O(10 2 ) for N u ~ O(10) and 
Ao ~ O(10 -6 m) requires a stretching factor of L b {tb) / 'Lb(t a ) ~ O(10 3 ). But the time required for a relativistic beam 
to traverse a compaction ring of circumference Co > LbiH) ~ O(10 2 m) is already about t ~ O(10 6 s). The problem 
is that if we try to damp for several e-foldings such that \i = O(10 -2 ), then the when it comes time to re-compress 
the bunch after cooling, we naively expect the compression to take 0(fi~ 1 ) ~ O(10 2 ) longer in the same lattice 
(with the same dispersion function), because the energy deviations are proportionally smaller. So the stretching and 
compression is not entirely trivial, and, unfortunately, perhaps different rings must be used for the compression and 
re-stretching, but we leave lingering questions for another day. 



B. Energy Kick Statistics for Individual Particles 



Within this simple model, the first-order and second-order statistics of the cooling kicks for individual particles 
can be worked out analytically by lengthy but straightforward calculations, at least to leading order in the various 
small quantities, so we will summarize the results without presenting the unilluminating mathematical details and 
intermediate steps. 



1. First- Order Statistics 



Within the kicker, recall that the effects of both beam self-fields and any newly-emitted spontaneous radiation can 
be assumed negligible compared to those of the wiggler fields and amplified radiation from the pickup wigglcr. The 
latter fields, while highly amplified, still have a small effect on the actual spatial trajectories followed by the particles, 
as compared to the effect from the static wiggler field, so each single-particle orbit can be assumed to be determined 
by initial conditions at the entrance to the kicker and by the Lorentz forces produced by the wiggler field only. Of 
course, the whole point of the kicker interaction is to change the energy of the particle, and any such energy kick must 
arise from work done by a component of the optical electric field E(x, t) in the kicker parallel to the particle velocity. 

The total energy change inside the kicker for the jth particle during a given pass through the cooling section can then 
be estimated as: 

tj+Atj 

where q is the (signed) charge of the particle; Xj(t) is the corresponding particle trajectory inside the wigglcr, and 
Vj(t) — 4iXj(t) = v Zj (t)z + v±. (t) is the particle velocity, both of which can be approximated assuming the dynamics 
are governed only by the wiggler field; E{x 1 t) is the electric field of the amplified pick-up radiation as it propagates 
through the second wiggler; tj was the arrival time of the particle at the entrance of the pickup wiggler; At, is the 
total time delay between arrival at the first wiggler and arrival at the second wiggler, depending on the time-of-flight 
delay introduced in the bypass lattice; and Tj is the duration of time spent in the second wigglcr. That is, the energy 
kick due to the interaction of the optical electric field with the transversely quivering particle can be calculated in a 
perturbative fashion, in which to leading-order the effects of the changing energy on the particle orbit arc neglected. 
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Neglecting various small terms, the conditional average energy kick for the jth particle, given values for the reference 
energy and of its own pre-kick energy deviation, but averaged over all intrinsic quantum mechanical uncertainty in 
the radiation as well as over the classical shot noise, i.e., random arrival times of other particles in the its sample, is 
given by 
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where 6(s) is the Heaviside step function, <f>jj — 4>jj(tj,jj,jo) is the phase of the jth particle's amplified self-field as 
seen by that same particle at the entrance to the kicker, while tj is the time of arrival of particle j at the pickup. 

Note that 4>jj is determined by the particle's time-of-flight through the pickup wiggler and bypass lattice compared to 
the delay of the pickup radiation during propagation and amplification as well as any further phase-shift introduced by 
the laser amplifier system. If the particle delay in the bypass lattice is arranged such that for sufficiently small energy 
deviations 5jj, the relative delay leads to gsin(0j) oc — 5jj + 0(5jj), then the particle will experience a restoring 
force on average, tending to push the energy back toward that of the reference orbit. Ideally, the probable range of 
variation in Sjj before the cooling kick is mapped into just ± ^ of phase delay, so that the magnitude of the damping 
force grows monotonically with the field gain vG and the magnitude |<!>7j| of the energy deviation. 



2. Second- Order Statistics 



After some more algebra, the corresponding conditional expectation of becomes approximately: 
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where &l = \N U ^ L - is a measure of the amplifier bandwidth (FWHM) relative to the coherent bandwidth of wiggler 



radiation, and is 0(1), while Si(a;) = J du sm ^> is the sine integral. Note that Si{—x) — — Si(x), and for |x| < 



Si(a;) 



approaching Si(a;) 



Ygz 3 , while for x > Si(a;) undergoes decaying oscillat ions, peaking at Si(7r) w 1.852 and asymptotically 



as x — * oo. The first term on the RHS of (173) is due entirely to self-fields, and is just the 



square of the first-order conditional average; the second term is the contribution from shot noise, reflecting heating 
from other particles within the sample, while the final terms reflect the minimal additive amplifier noise consistent 
with quantum mechanics. 

In principle, the conditional two-particle correlations (AjjAjk 1 77,70) are also needed to determine the evolution of 
the beam energy spread, but with good mixing, the effect of these cross-terms will be negligible when averaged over 
all the particles in the beam, so we need not actually calculate them explicitly. 



C. Prom Single Particle Statistics to Beam Properties: 
Evolution of the RMS Beam Energy and Energy Spread 



Because the SNR is so low, the magnitude and even direction of any given particle's energy kick on any one pass are 
subject to large uncertainty, and cannot be estimated reliably. In conventional RF stochastic cooling, relatively small 
kicks delivered over many passes lead via a time-averaging or smoothing procedure to a continuous-time, Fokker- 
Planck equation describing the evolution of the single particle distribution function. Any one kick for any one particle 
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cannot be predicted with much precision, but the cumulative effect of many kicks enjoys the usual advantages of the 
law of large numbers, and can be reliably determined. 

In ultra-fast OSC, the beam must be cooled with a relatively small number of relatively large kicks, so time-averaging 
is less justified, and less effective at improving predictability. However, particles phase space deviations remain largely 
uncorrelated during the cooling, so while prediction of individual particle behavior is unreliable, averaging over all 

particles in the beam is expected to give o(^N b 1//2 ^ improvement in predictive accuracy. 

In analyzing the stochastic evolution of beam properties, we should be careful to distinguish between arithmetic 
means over all beam particles, so as to define intensive beam properties such as the average density, velocity, energy, 
or energy spread, or (classical and/or quantum mechanical) statistical or ensemble averages, or expectation values, 
over our uncertainty in cither individual particle or collective observables. 

If t- is some time just before the beam enters the cooling section, at which point the average perp-particle energy is 
70, and t+ is a time just after it has passed through the cooler but before it reaches the next cooling section, then 



the net change in the mean particle energy can be predicted by further averaging the conditional averages ( 172 1 over 
the energy distribution for individual particles and over all particles in the bunch. The mean energy per particle of 
all beam particles beam at any time t is 

7(*) = F E^)' ( 174 ) 

which remains a "random variable." Since all the beam particles are assumed identical, The statistical expectation 
value of this mean per-particle energy after a cooling pass is 

(7(M) = (7j(<+)) = (7i(*-)) + (A7i) , (175) 

where 

(A 7 ,-) = «A 7i |7 7 -,7o)) (176) 



is the conditional average (172) averaged further with respect to the remaining random variables, i.e., with respect to 
the probability distribution for pre-kick particle energies. Ideally, the cooling will not change the mean beam energy 
appreciably but will only redistribute energy from more energetic to less energetic particles: 

(7(*+)> = <7i(*+)>«<7 J -(*-))=7o. (177) 

In order to determine cither the expected decrease in energy spread due to cooling, or the uncertainty in our prediction 
(standard error) for the per-particle beam energy, second-order statistics for (At,) 2 are also needed, but these two 
quantities should not be conflated: our uncertainties and intra-beam fluctuations are not the same thing. Since statis- 
tics for the different beam particles are approximately independent, we expect that the standard error of prediction 
for 7(f) will be 0(—/=) better than that for any one particle. 

Evolution of the mean per-particle energy has nothing to do with actual cooling per se: beam slowing is not the same 
as beam cooling. To assess the latter, we need to consider second moments. 

First consider the unconditional variance of the jth particle's energy, defined by: 

a](t) = Var[ 7j -(i)] = ([ 7j -(*) - (^(i))] 2 ) , (178) 

so that 

^(i+) = o?(t_) + [«(A 7j ) 2 |7i,7o»-<A7i) 2 ] {m) 
+ 2 [«A 7j - | 7j , 7o ) <y 7i (t_)) - (A 7i ) (* 7i (t_)>] 

If (A 7 j) w 0, then this can be rearranged and simplified to yield: 

a*(t + ) - <r?(t_) = 2 (A 7 ^ 7i (t_)) + <(A 7i ) 2 ) , (180) 
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where ((Ajj) 2 ) = (((At^) 2 7j,7o/)j and (Ajj5jj(t_)) — ((A-fj \ jj, 70) 5jj(t_)}. S ince for small energy deviations, 
the kick will satisfy Ajj ~ — Sjj by design, the first term on the right-hand side of (180 1 is negative and reflects the 



impact (on our uncertainty, not necessarily physically, on the particles) of cooling due to the coherent self-interaction, 
while the second term is always positive and represents the effects of heating. However, it is important to note that 
aj(t + ) actually quantifies changes in uncertainty in the jth particle's energy after being subjected to the stochastic 
kick, and is not equal to the particle's actual but unknown energy deviation Sjj(tf) or to the unknown change in this 
energy via the kick, although it is numerically equal to the expectation value of the squared physical energy deviation. 

But in the present context, cooling means reducing the actual extent of energy deviations amongst the beam particles, 
not reducing our uncertainty in what the particle energies might be, so ( 180 I is not necessarily equal to the real quantity 
of interest for cooling. 

Longitudinal cooling can be assessed by changes in some measure of the actual fluctuations about the mean energy, 
such as the RMS energy spread Sj(t) of particles in the bunch at time t, defined by: 



6j(t) 



N b 

i£(7^)-7(*)f 



1/2 



(181) 



Because the kicks are stochastic, the S^j(t + ) 2 , and hence 6j(t) cannot be known exactly, but the RMS expectation 
value of the latter can, and is approximately given by: 



[5^t)] 2 )^a 2 (t). 



(182) 



That is, on average, the RMS energy spread of the bunch is approximately equal in value to the single-particle energy 
variance, as might be expected from a bunch comprised of particles which are assumed statistically independent. But 
while the magnitude and direction of any one particle's kick are subject to large uncertainty and cannot be estimated 
reliably, averaging over all the beam particles ensures that ( 182 I will be quite accurate as a point estimate for the 
overall RMS energy spread within the beam. 

The exact calculation of this standard error would require knowledge of still higher-order moments, but assuming a 
Gaussian moment closure (justified by the Central Limit Theorem under our present assumptions), we can obtain the 

estimate Var [(57(t+) 2 ] ~ O ( jf^J °f (*+)) so the relative uncertainty of our estimate of the RMS energy fluctuations 
in the beam is expected to be O (^jj^ j ^ 1- 



Defining the instantaneous cooling rate r c 1 for energy spread as: 



2 dt 



(183) 



a formal expression for the approximate energy cooling rate averaged over the current cooling pass can be found: 
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where again /o is the frequency of cooling kicks experienced by the bunch. Since Ajj 
of the form of equation ([I]) . 



G, Note that this is exactly 



If the particle energy distribution is assumed to start and remain Gaussian, and the time-of-flight delays are assumed 



to be linear functions of the energy deviations, then (184) can be calculated analytically. As suming optimal choices 



for gain and time-of-flight delays, the expressions become essentially identical in form to (18), with N„ « ^j^N u \o, 
and with N n assuming a quite complicated explicit form, but for which typically N n ~ 0(a~ 1 ). in practice. If the 
Gaussian approximation is not valid, then the distribution functions can be evolved numerically, but in this case 
higher-order moments may be needed, which are increasingly cumbersome to calculate. 
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The cooling rate is essentially that predicted classically, except for some unavoidable noise from the quantum amplifier 
which appears additively, roughly as the equivalent of an extra 0{a^ 1 ) or so particles in the sample, not as a 



multiplicative degradation as in the rate (19) in the Poissonian emission model, or worse, as in (21) with both 
quantum jumps and Heisenberg phase noise. 

To estimate the uncertainty in this predicted cooling rate, we would need to know fourth-order moments of the 
energy kicks and energy deviations, which can be laboriously worked out if needed, but again assuming Gaussian 
distributions, the estimated rate can be shown to be statistically reliable as a characteristic of the behavior of the 

beam as a whole while undergoing cooling, because of the now familiar O \ ~^° \ scaling. 



However, note that ( 184 1 should not necessarily be interpreted as the reciprocal of a true exponential decay time, 
because the continuum approximation needed is not fully justified in the ultra-fast OSC regime. With traditional, 
slower stochastic cooling methods, typically the effects of many (O(10 3 )) kicks must be accumulated before any 
appreciable reduction in phase space occurs whatsoever, so over such long times scales (compared to /q -1 ), cooling 
kicks can be assumed to be effectively smeared out more or less continuously in time, and the dynamics can be 
approximated by a Focker-Planck type differential equation. Here, each individual kick is much bigger, but fewer are 
imposed, and accurate cooling estimates really require evaluation of the actual discrete-time difference equations. 



D. Final Cooling in Muon Accelerator 



With the classical model for OSC essentially confirmed, apart from some lower bound established on the additional 
additive amplifier noise that has its origins in quantum mechanics, rough estimates for cooling times achievable with 
given power, or for the power required to achieve given cooling rates, can be determined. The demands of microsecond- 
scale cooling exceed present technology, but slower cooling would require more modest systems. Because our rough 
estimates agree with those presented in [5, 6J, we just quote the results. 

Cooling a7~ O(10 3 ) beam with A& ~ O(10 9 ) particles per bunch, on a time-scale t c ~ O(10 -7 s) in order to achieve 
an O(10 -3 ) relative decrease in longitudinal emittance would require stretching to a size where N s ~ O(10 2 ), and 
would require about O(10) lasers amplifiers, each producing O(10 2 W) of average power at O(10 2 Hz) repetition rate, 
which is beyond the current capabilities of existing amplifiers, but perhaps achievable in the next few decades. 



E. Some Obvious Limitations and Extensions 



1. Transverse Cooling 



Although for simplicity we do not treat transverse cooling here, we should point out that it is possible, and in fact 
can be performed simultaneously with longitudinal cooling, at the expense of a certain slow-down in the cooling rates 
if the same cooling sections are used for both longitudinal and transverse DOFs. 

As mentioned above, the trick is to exploit dispersion. For sufficiently small deviations from the desired reference 
orbit, the full trajectory of a transverse coordinate (say the x coordinate) for a particular particle in the kicker is 
given by: 

xj(z) = x o (z) + x 0j (z) + ^-D x (z), (185a) 
x'^z) = x' o (z) + x' .(z) + 5 -%D' x {z), (185b) 

where: xq(z) is the reference orbit in the cooling section (including the expected quiver motion); xpAz) represents 
the typically slowly-oscillating (compared to the quiver) betatron orbit (transverse deviations from the reference orbit 
in the absence of dispersion) , determined by the beam optics of the external transport lattice and any focusing fields 
associated with the wiggler itself, together with the particle's initial conditions; — is the relative energy deviation 
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from the reference orbit; and D x (z) is a dispersion function depending on the lattice and the insertion device (i.e., 
kicker wiggler), reflecting the fact that particles with different energies arc bent differently in magnetic fields, while 
here primes indicate derivatives with respect to the longitudinal position, which is just taken as z in the straight 
kicker section. 

If the delay in the bypass between pickup and kicker is made proportional (in part) to the betatron amplitude, i.e., 
with a component proportional to either xjp or x'- p, then the resulting energy kick Ajj will also depend on these 
errors in the transverse orbit. If the kicker is located in a region of appreciable dispersion, then as Sjj changes due to 
work done by the transverse fields of the amplified radiation on the particle, as it quivers in the kicker, Xjp(z) and 
x'jp(z) must also change in compensation, since x n (z) and x' (z) are fixed, and the overall transverse position Xj(z) 
and angle x'j(z) must remain continuous. If the phasing is arranged properly, in this way the particle can be nudged 
onto a smaller-amplitude betatron orbit, which will be apparent when it emerges into a region of smaller dispersion. 



2. Neglected Classical Optical Effects 



For the proof-of-principle argument here, we have assumed flat amplitude gain, and ID radiation fields propagating 
in vacuum without diffraction or dispersion. Since the structure and phase of these fields is essential to the success 
of the transit-time OSC scheme, more careful calculations will eventually be needed to make tighter quantitative 
predictions. 



a. Dispersion 

Since any cooling depends crucially on careful phasing, any dispersion in the amplifier or the optical transport system 
will diminish the effectiveness of the OSC. The times-of-flight can of course be adjusted to compensate for any overall 
average delay for the frequency band, linear group- velocity dispersion and any nonlinear phase-modulation within 
the amplifier are expected to be deleterious to some degree. Such dispersion can come from the usual linear indcx- 
of-refraction of the medium or the optics, or from the phase shifts that are proportional to the gain (amount of 
stimulated emission in the medium), which in practice is frequency-dependent. 

b. Diffraction/Transverse Effects 

We have here effectively assumed that the amplified wiggler radiation consists of transverse plane waves, but even for 
well-collimated particle beams where a± <C Wo, certain paraxial effects may become quite important. 

Recall that the transverse polarization component of a fundamental Gaussian vacuum paraxial mode, assumed to be 
plane polarized and propagating along the z direction with focus located at z = 0, can be written as: 

E X ( X , y, z; t) = E ^ -^^^ [ifr + afej] , (186) 

where E is the overall complex amplitude, k is the carrier wavenumber, ivq — ck is the carrier frequency, r = 
yjx 2 + y 2 is the transverse distance off-axis, w(z) = w 1 + is the local waist, R(z) = z 1 + (4f) 2 is 

the radius of wavcfront curvature, n(z) — arctan is the Guoy phase, and Z Ti = \k w 2 is the Raleigh range, 

where wq is the waist size at focus. 



Issues of coupling efficiency suggest that both kicker and pickup wigglers should have identical values for \ u and 
(nearly) identical values for N u , while the Raleigh range of the amplified radiation in the kicker wiggler should be 
comparable to that of the radiation emitted in the pickup, by proper choice of collection and transport optics. As 
the latter is approximately Z R ~ ^N U X U , diffraction will likely just become appreciable over the interaction distance 
N u \ u in the kicker. 

The direct effects of the wavefront curvature are probably safely ignored, and the intensity fall-off due to the expansion 
of the transverse waist can be accommodated, although somewhat higher amplifier powers than estimated in the ID 
theory might be needed. Of most concern is the Guoy phase shift. If the laser focus is placed near the center of the 
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wiggler to help minimize amplifier power requirements, the Guoy phase will vary rapidly from close to —5 to +^ 
inside the kicker, and despite best efforts to ensure careful relative timing, the energy kick may tend to average away. 
This needs to be studied carefully. If the focus must be moved off-center or, worse, perhaps placed outside the kicker, 
by a distance comparable to the Raleigh range, in order to avoid a the region of rapid phase advance near the focus, 
then power requirements for OSC will become even more severe. 



c. Near-Field Effects 



With quantum-mechanical fears allayed, an accurate quantitative calculation will require accurate knowledge of the 
classical field. The paraxial Gaussian form may not be sufficiently accurate. For example, the solcnoidal (i.e., 
functionally-transverse, or divergence-free) part of the classical electric field (and hence the expectation value of the 
hemi-classical "photon" field) radiated by a single beam particle in the pick-up can be expressed in the well-known 
Licnard-Wicchcrt form: 



Ej(x, t) = q 



yfiX^hfjf. 

J 

"jx{(ra j -^ i )xgj} 



(I), 



(187) 



where Rj = x — rj(t), Rj = \Rj\, hj = ^r, rj(t) is the particle trajectory, (3j(t) = j:Tj(t) = ^jfj(t) is the normalized 

particle velocity, /3j(t) = ^(3j(t) is the normalized acceleration, and t'-, defined implicitly by t = tj + ~, is the 
retarded time. 



The far-field radiation pattern is determined by the asymptotic form of the final term only, which is proportional 
to the particle acceleration, and falls of like O (Jj) in distance from the source. The form of the far-field brightness 
function (i.e., optical Wigner function) of synchrotron radiation from relativistic particles moving in bending magnets, 
for which ^ < 1, has been worked out[7] in the paraxial case for regimes in which a u <C IZZ70 or 1 -C a u -C 70 for 
essentially any N u > 1, or for essentially any a u <§; 70 if N u ~ 1 or N u ^> 1. 

In the presumed OSC regime, where N u ~ O(10), while a u > 1, an exact analytic form is lacking, and the velocity 
fields may not be negligible. Since the radiation is emitted over the length N U X U , we would naively expect that a 
Fraunhofer-type approximation to the angular spectrum can be deduced from the acceleration fields and will be valid 
if the radiation is observed at some suitably large distance R from the wiggler, such that R ^> Ao, R S> N U X U , and 

(N X ) 2 

R ^> \" ■ While the first condition is entirely trivial, the second is unlikely to hold, and the third will certainly 
not hold. In order to have time and room to amplify the radiation and feed it back onto the particle beam, the 
wiggler radiation emitted over 0(5 m) will be collected, collimated, and imaged into the amplifier at a distance only 
about O(lm) from the end of the pickup, only a few or maybe ten times A u downstream, where near-fields will still 
be present, and their effects are not entirely clear. Such considerations will be explored in subsequent research. 



XIII. DISCUSSION 



Despite fears that quantum uncertainties would prove devastating for optical stochastic cooling based on the amplifi- 
cation of very weak pickup signals, the technique of OSC can in principle work despite these quantum uncertainties. 
What, then, precisely, went wrong with the naive reasoning? 



A. Why "Naive" Quantum Treatments Fail 



Particles do not emit wiggler radiation described by photon number states or any statistical mixtures of of such states. 
They do not radiate into any states resembling a number state, but rather a Glauber coherent state, or a classical 
statistical mixture of such states, in which photon number is not sharply defined, but phase information is partially 
available and joint quantum mechanical uncertainties in number/phase or in the quadrature components are jointly 
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minimal. These states can have arbitrarily low expectation values for photon number or EM energy, but still be every 
bit as "real' and "present" on each pass as any number state. 

Particles do not radiate in a series of stochastic quantum jumps, only once every 0(a~ 1 ) passes through the pickup 
wiggler. The radiation from each particle is present on every pass, even though its average energy may correspond 
to much less than one photon, and is available to be amplified by a quantum mechanical interaction, provided no 
intervening measurement first projects the state by actually trying to count photons. Moreover the wiggler radiation 
possesses, on average, exactly the amplitude and phase it would possess classically, so the "coherent signal" information 
necessary for transit-time cooling is effectively always present, only corrupted by the equivalent of some additional 
additive (not multiplicative) noise needed to satisfy the uncertainty principle, noise that physically can be traced back 
to "vacuum fluctuations" and amplified spontaneous emission within the gain medium. 

A linear amplifier does not function by multiplying photon number, either coherently or by projective measurement, 
but rather acts unitarily on the full quantum mechanical state of the EM field, scaling the field amplitude and adding 
some extra noise. 

In the pickup, the particles do not radiate whole numbers of photons, and in the kicker the particles do not measure, 
absorb, or otherwise sense photon number, but rather respond directly to the EM fields. 

Nothing actually prepares, emits, multiplies, evolves towards, measures, or counts photon number states, so the 
"discrete" nature of photons is not terribly important. 

A careful analysis reveals that the naive quantum mechanical fears were misguided: the cooling rate which accounts 
for quantum mechanics essentially agrees with the classical rate calculation, except that the amplifier noise, which 
can in principle be made arbitrarily small classically, is constrained by the quantum mechanics to some non-zero lower 
bound. 

This lower bound reflects the fact that the total fluctuations of the amplified field include the amplified input noise, 
which just consists of minimum-uncertainty vacuum fluctuations for coherent states, as well as added amplifier noise 
consisting of an independent set of amplified vacuum fluctuations. Input noise is amplified along with the signal 
because of linearity: a linear amplifier cannot know what part of the input state will later be regarded as noise or as 
signal. The additional noise is added as a consequence of the uncertainty principle: a phase-insensitive linear amplifier 
amplifies both quadrature components of a field mode, which do not commute, so at least one extra mode must be 
involved. 

Therefore quantum mechanical noise does not prevent fast optical cooling, but it does limit how small the incoherent 
heating terms can be made. The resulting expressions and estimates for the cooling times agree with classical 
calculations, provided the amplifier noise is accounted for through an effective value for N n . 



B. Synchrotron Radiation Damping 

If the naive "quantum jump" model of discrete Poissonian photon emission failed so spectacularly for the case of OSC, 
why does it work so well for synchrotron radiation damping (SRD) of electrons in storage rings? In such a damping 
scheme, rclativistic electrons of normalized energy 7 orbiting in a ring of radius ro spontaneously emit radiation in a 
continuum of frequencies up to about u c w §7 3 ^, and confined to an angular cone of about O(^) with respect to the 
electron orbit. Each electron emits the equivalent of about N ph ~ ^7<^ photons per revolution, at an average energy 
(fku) ~ i5 ^ fiuj c . For example, assuming an O(GeV) electron beam, corresponding to electrons with 7 ~ O(10 4 ) 

circulating in rings with r ~ O(10 2 m), each electron radiates the equivalent of about N ph ~ (9(6 • 10 2 ) photons per 
revolution, at wavelengths down to about A c = ~ O(10~ 10 m), corresponding to O(keV) X-rays. 

On average, each electron loses momentum along its direction of motion (parallel to its betatron orbit) due to the 
recoil from this synchrotron emission. If they are then re-accclcrated longitudinally, along the reference beam-line, 
transverse emittance is this reduced. 

Ultimately cooling is limited by quantum mechanical "randomness" in the emission, which, consistent with fluctuation- 
dissipation relations, leads to an incoherent heating term in addition to the damping term, which limits the cooling 
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rate as well as the ultimate emittance that can be achieved in equilibrium. In a pioneering and influential study, M. 
Sands [8 j successfully modeled these effects by using what we have called the "quantum jump" model, assuming that 
electrons emit radiation in the form of a whole number of quanta in random Poissonian fashion. 

In hindsight, it is clear that the OSC and SRD occur in completely different regimes and rely on rather different 
physics. In the OSC, each particle emits on any pass the equivalent of very few photons of small momentum, so direct 
radiation reaction effects are completely negligible. In SDR, each electron emits the equivalent of many photons of 
high momentum, and recoil effects in the virtual Compton scattering processes are significant - in fact, the large 
cumulative effects of recoil over several turns are precisely what is of principle concern. 

In OSC, the radiated field is collected, coherently amplified, and fed back onto the particles that emitted it, to induce 
an energy kick far larger than any direct losses due to emission, so the quantum state of the radiation matters greatly, 
while in SDR there is no feedback other than the individual electron recoils. Once any emitted radiation propagates 
away, it becomes irrelevant as far as the individual particles are concerned, and in fact one could in principle "trace 
out" the field degrees-of-freedom to obtain a non-Hamiltonian, diffusive evolution equation for the particles only. 

In neither case do we actually expect the particles to radiate a whole number of quanta in discrete emission events, 
but in the case of SDR, whether the radiation is described by a number state or coherent state or some other state is 
largely irrelevant, as long as we get the low-order moments right. Many different microscopic stochastic models can 
lead to the same Fokker-Planck equation. In effect, the Sands technique gets away with modeling a diffusion equation 
by one particular choice of a random walk model for emission which does not faithfully describe the actual emission 
process or the resulting radiation phase space, but mimics the proper statistics for the electron phase space, which is 
of primary interest. 

If simple scaling arguments or back-of-the-envelope calculations predict that the discrete nature of the energy eigen- 
states of the EM fields might be important, then rather than justifying the quantum jump model, this only gives more 
reason to adopt a fully quantum treatment of the radiation, at least if radiation properties or statistics are themselves 
needed, because quantum mechanics evolves the radiation modes or any system unitarily, not by a series of quantum 
jumps. If the radiation itself is never examined, but only the accumulated effects of its back-action on the particles 
during emission or absorption, then sometimes the Poissonian model will be useful, as with the Sands treatment of 
synchrotron radiation damping, or with the more sophisticated Poissonian emission model used by Friedland |58j to 
tease out the small-signal gain in FELs. 



C. Is the Field Evolution Really Unitary Throughout? 



The metaphor of corpuscular photons emitted in quantum jumps may be alluring, but can be deeply misleading in 
the analysis of stochastic cooling as well as certain other applications of wiggler radiation. Benson and MadevfT!?] 
assert: 

. . . the wiggler is capable of reducing the wavefunction [of a charged particle] by causing it to emit a 
photon. The electrons position can therefore be deduced from subsequent measurements. 

Reluctant as we are to disagree with pioneers of FEL physics, we are of the opinion that, as a reasonably accurate 
description of our original, naive opinions concerning quantum mechanical effects of the wiggler radiation, this state- 
ment is completely incorrect in the current context. We contend that the wiggler magnet itself does not in any way 
collapse or reduce any particles wavefunction — only a subsequent photon-counting or similar optical measurement 
will do that, which never happens in the context of OSC. Until a reversible macroscopic measurement of particles or 
radiation is made, the EM fields and beam particles evolve unitarily. Besides, with many electrons in a sample length, 
observation of a photon can tell us very little about the position or momentum of any one electron, so "subsequent 
measurement" only partially project a given particle's wavefunction. Nor does the wiggler cause the particle to emit 
a photon, if by photon we mean either a number state of the electromagnetic field, or even just a state of definite 
energy and/or momentum. In most practical regimes of interest, we have seen that the particles in a wiggler will 
generally emit radiation described by statistical mixtures of Glauber coherent states, which behave far differently 
(more classically) than number states. If recoil or other reaction effects become important, than the full quantum 
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state of the radiation and particles may become highly entangled, or may exhibit other features associated with 
non-positive Glauber-Sudarshan quasi-distributions that cannot be reproduced classically, but there is no reason to 
think the states of the field will be close to eigenstates of either the number or energy operators or would be an even 
approximately diagonal mixtures of these states. 

If photon-counting experiments are chosen to be performed, then whole numbers of photons will of course be measured, 
but this has little to do with what state describing the radiation before measurement. Of course the calculations of 
the field dynamics and field statistics can be performed in number state basis, or any other basis we choose, but 
the density matrices will not generally be sparse or nearly-diagonal in any sense, so the P-function representation is 
particularly natural and convenient. 

Other fields learned these lessons earlier. In [50 , Gordon, Louisell, and Walker conclude: 

We find that a classical description of the input fields and of the amplification process is completely valid 
provided we take correctly into account the response of the amplifier to the input zero-point fields. This 
result is valid for inputs of arbitrarily small power. 

This is basically correct, apart perhaps from a possible factor of two clarified by Caves [45], arising, as we have seen, 
from the extra set of amplified vacuum fluctuations which must necessarily be added to the field for amplifiers with 
gain on non-commuting field obscrvablcs, in addition to the original amplified vacuum fluctuations, which must be 
amplified along with the signal, because the amplifier cannot know what part of the input field is to be regarded as 
signal and what part is considered noise. 

If the wiggler magnet is not collapsing the wave function of an electron and causing it to emit a photon, perhaps the 
amplifier is? For large gain, whether the action of the amplifier is construed to be a measurement or not depends 
on your favorite interpretation of the quantum mechanics. Some might argue that it can be regarded as an effective 
measurement because it amplifies the field to levels that "we can lay our classical hands on," i.e., that can be 
classically observed, measured, and manipulated without significant further disturbance, measurement back-action or 
degradation of the SNR. Others might object, pointing out that no agent makes an irreversible, macroscopic record 
of any information in the radiation fields, at least not until after they are re-applied to the particles. Either way, it 
makes little difference to our argument. Even if you construe the amplifier to be performing a measurement of the 
radiation, it is doing so in a coherent state basis, not a Fock state basis. 

What about the environment at large? Even in a state-of-the art beam-line, with high vacuum, vibration control, etc., 
uncontrollable interactions with the numerous degrees-of-freedom in the environment will inevitably occur, leaving an 
imprint of information in various quantum entanglements. Without observational access to all of these environmental 
DOFs, the reduced density matrix for the system alone (particles and fields) will appear impure, and subsequently 
the system can behave as if the environment has effectively performed a measurement but just not told us the answer. 

This is the basic argument behind environmentally-induced decoherence|14[ 1151 H6[ 159] . which we have already in- 
voked in arguing for the classical behavior in the electron beam itself. While quantum-mechanical decohcrcncc in 
the radiation fields will undoubtedly proceed to some degree, it actually only helps our argument. For simplified but 
sensible models of "the environment" , it is again the coherent states, not the Fock states, that survive the "predictabil- 
ity sieve" and emerge as the preferred "pointer-basis" into which the decohered radiation states will appear to be 
projecteddS] GEH ED IS2]- This should not be too surprising. Decoherence seems to favor classical-looking states, and 
has in fact been suggested as a reason for emergence of a classical world. Conversely, a certain robustness or stability 
in the face of inevitable sources of decoherence is a key feature of classical-like states. Coherent states, or ignorance 
mixtures thereof, are according to the pioneering work of Glauber and Sudarshan and countless investigators since 
then, the closest thing to classical radiation fields allowed by quantum mechanics. 

Conversely, If we want to see discrete photons, we have to perform photon-counting experiments, which are usually 
(but not always) destructive, and usually do not leave the remaining field in a number state. If for some reason 
we really want the "amplifier" to multiply photon number states, rather than scale coherent states, then we must 
use photon-number-amplifiers (PNA)[57 rather than linear amplifiers, devices which have been proposed but which 
would be difficult in practice to operate with high fidelity. (Of course, using either unitary PNA or non-unitary 
photo-detection would be disastrous for OSC, as our naive model predicted). Fock states or other non-classical states 
with negative Glauber-Sudarshan quasi-distribution functions are just rather delicate and temperamental creatures, 
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difficult to create and maintain, and fragile to the touch of the environment. The bane of many investigations in 
atomic physics or quantum optics, this fragility of quantum coherence only works to our advantage here. 



D. Can the Quantum Mechanical Noise Limits be Achieved? 



We have found that quantum mechanics imposes a firm lower bound on the amount of noise that must be added by 
any linear amplifier in order to maintain consistency with the uncertainty principle and with unitarity, but have not 
addressed how difficult it may be to approach this ideal noise floor. 

In practice, there are also equilibrium and non-equilibrium thermodynamic limitations. The gain medium, as well as 
the various other optical elements arc coupled to thermal baths, so the "initial" state of the radiation field prior to 
the arrival of the pickup signal is typically not a vacuum, but a thermal blackbody distribution, with an average of 

W - (188) 

e k B T _ 1 

photons in each mode I of frequency u>g. At high temperatures, i.e., k B T 3> hujk, the average occupation (Ne) — > ^j- 
1, and the amplifier will have an effective noise at input far higher than the zero-point vacuum energy. Conversely, 

for low temperatures, where k B T <C hui£, the average photon number becomes (Ni) — > e fe sT <c ±. For A ~ 0(1 /mo), 
or equivalently hujg ~ 0(1 eV), and any temperature T < O(10 4 K), the thermal occupation is negligible, and the 
blackbody state will be virtually indistinguishable from vacuum as input. 

Along with large gain bandwidths, this is another tremendous advantage optical- frequency stochastic cooling might 
enjoy over traditional schemes. Merely from purely thermal considerations, it would be extremely difficult to construct 
and operate amplifiers anywhere close to the quantum limit at radio or microwave frequencies, but this is possible 
at optical frequencies, thanks to the exponential dependence of the Boltzmann factor. However, with very strong 
pumping, some active or passive cooling may be needed to ensure quiet and efficient operation. 

However, while the surrounding radiation field may start out in thermodynamic equilibrium, the amplifier itself is 
not, but instead subject to various energetic flows from the pump as well as the input and output signals. In fact 
a non-equilibrium state is of course necessary in order to achieve gain. A finite pump strength leads to a finite 
population inversion which in turn leads not only to smaller gain, but to "excess" spontaneous emission beyond that 
needed to satisfy the uncertainty principle. 

Using semi-classical arguments, it can be shown that the gain for narrow-band propagation along z in a typical 
inverted medium can be written 

G = e r *»l, (189) 

where 

r -^. (190) 

is the gain profile of the medium, and where Au> is the bandwidth, n ir > 1 is the index of refraction of the gain 
medium, A is the wavelength for the resonant lasing transition between levels 2 and 1, 7723 < 1 is the quantum 
efficiency for pumping, t is the characteristic fluorescence lifetime, or time-scale for spontaneous emission, ni is the 
average occupation density of the upper level 2, and 

P21 = > 1 (191) 

z 92 1 

is the population inversion parameter, in which ri\ is the average occupation density of the lower level 1 and g\ and 
gi are degeneracy factors for these levels. 

The effective number of noise photons then becomes 

Ko i se = 5 + (f , 2i-^)|l-^|, (192) 
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which approaches the ideal lower bound as G — > oo and P21 1 + , the latter implying total population inversion. 

A back-of-the-envelope calculation indicates that required pump power in a multi-stage Ti:sapphire solid state laser 
amplifier suitable for OSC might correspond to G ~ O (10 4 ) ^> 1 and P21 ~ O ( which corresponds to a noise power 
only 0(25%) higher than the quantum limit. In order to incorporate higher-than-ideal noise levels, we can just boost 
the effective noise temperature in the displaced thermal state describing the wiggler radiation after amplification. 
Again, the effects will be similar to those obtained by adding more particles to the sample. 

Quantum mechanics neither requires nor prevents an actual amplifier from exhibiting some multiplicative noise in 
addition to the required additive noise, for example due to pump fluctuations, but it will not be of the drastic all- 
or-nothing character that proved so deleterious to cooling in the Poissonian emission model. Since the thermal or 
chaotic states are already the sate of maximum entropy for given moments, we expect that they can still be used in 
the case of multiplicative noise in the gain, just with the second-order moments suitably increased to account for the 
extra noise. 



E. Comparison to the Heifets-Zolotorev Model 



Independently, Heifets and Zolotorev|63j developed a fully quantum ID model of transit-time optical stochastic 
cooling 86J . They follow the combined von Neumann density matrix for the particle bunch and the radiation through 
the various components of the cooling system, including the pickup wiggler (still with classical fields), the bypass for 
the particles and the amplifier for the radiation, and then the classical fields of the kicker wiggler. While both particles 
and fields are treated quantum mechanically, a simplified one-dimensional, single-mode (with fixed wavenumber k z ) 
model of the interaction inside the wiggler is used to calculate the initial emission by the particles and the final energy 
kick, and a simple model of the optical amplifier is employed, based on a completely-inverted population of two-level 
atoms. 

To solve analytically for the evolution, a number of expansions are made in various small parameters, particularly 
^0 ^ ' "^ ph ^ ^ ^ e mean num ber of coherent photons emitted per particle in the pickup) , -^J , and ■ 

After a minor tour de force in the manipulation of special functions, they obtain results which appear to be very 
similar to those of our hemi-classical theory: namely, assuming the coherent field mode is initially in vacuum, the 
pickup radiation from a single particle will be described by a Glauber coherent state, with average photon number 
N ph ~ a. The reduced density matrix of the amplified radiation is that of a chaotic, or thermal-like state, with the 
equivalent of one photon of amplifier noise at input (half for the original "zero-point motion," and half again from 
an independent set of atomic fluctuations, because the amplifier provides gain for both non-commuting quadrature 
components), although their combined particle-field density matrix contains the small but vital correlations (what 
classically we called the coherent component of the signal) needed to cool the particles. The cooling rates are close 
to the classical results except with the equivalent of 0(a~ 1 ) extra particles in the sample. 

Even for a simplified dynamical model, the fully quantum results are impressive, but only seem to confirm what 
we have established, namely that a hemi-classical formalism (which in the end really just amounts to a classical but 
stochastic model) is sufficient, and in principle will be more easily generalized to account for finite bandwidth, off-axis, 
or other neglected effects (which, however, are beyond the scope of our current investigations). 



XIV. CONCLUSIONS: SUMMARY AND FUTURE DIRECTIONS 



Assuming optimal gain, the rate of any stochastic cooling scheme can be improved by increasing the bandwidth 
of the pickup- amplifier-kicker system or by decreasing the number of neighboring particles (which actually cause 
heating) in a sample length, which varies inversely with the bandwidth. Large bandwidths available in solid-state or 
certain parametric amplifiers at optical wavelengths are very attractive for stochastic cooling, if various technological 
challenges can be met. In the transit-time OSC scheme, spontaneous undulator radiation from the pickup wiggler is 
amplified while particles are given appropriate phase delays in bypass, then amplified radiation acts back on particles 
within the kicker wiggler to reduce momentum spread and/or betatron amplitude. 
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The pickup sclf-radiation of each particle (responsible for cooling) is very small — on average, only 0(a) photons per 
particle in coherent mode, suggesting quantum effects may be important In a naive picture, akin to Sands treatment 
of synchrotron radiation damping, where particles emit whole photons in discrete Poissonian quantum jumps, it would 
seem particles rarely experience cooling self-radiation but almost always feel large heating cross-radiation, leading to 
drastically slower cooling or over-correction when the cooling photon finally appears. 

A more careful analysis has indicates the feared quantum catastrophe is a red herring: particles radiate into Glauber 
coherent states, or statistical mixtures thereof, and amplifiers amplify coherent states, not Fock (photon number) 
states. The cooling signal is really present on every pass. Particles emit radiation in a quantum mechanical state 
in which photon number and field energy are not sharply defined, but in which the expectation values of amplitude 
and phase are given by the classical values, rather than in some sort of quasi-classical Poisson process where whole 
photons are emitted in a series of discrete "quantum jumps." A low-noise amplifier will operate quantum mechanically, 
transforming the quantum state of the fields according to Hamiltonian dynamics, and does not act to first projectively 
measure then multiply photon number. After amplification, the amplified radiation in the kicker is essentially classical, 
and particles respond linearly to fields rather than via discrete photon absorption. The amplified phase signal from 
each particle is present on each pass, albeit partially obscured by some amount of noise due to amplified vacuum 
fluctuations of the original radiation field and certain internal amplifier modes. 

Therefore, quantum mechanical effects do not destroy cooling, but just introduce additional (additive) noise, approxi- 
mately equivalent (prior to amplification) to about 0(a~ 1 ) additional particles in a sample, setting an ultimate upper 
limit on cooling rates and on the gains in efficiency that can be expected from further beam dilution. 

If sufficient care is given to the quantum mechanical aspects of the radiation and amplification, which in the end just 
amounts to adding at least the minimum amount of amplifier noise that can physically be traced back to spontaneous 
emission in the gain medium, then optical stochastic cooling rates can be calculated using an essentially classical 
treatment with extra noise terms. For ultra-fast OSC, where the correction consists of a relatively small number 
of relatively-large kicks compared to traditional RF-based schemes, temporal smoothing or averaging is not really 
justified, but while individual particle behavior is subject to high uncertainty, properties the beam as a whole (mean 
per-particle energy and energy spread) can be reliably estimated because of averaging over many particles in the 
beam, rather than averaging over many time steps. 

Using the cooling rates as derived above for the simplified model of a cooling section, a very preliminary calculation 
indicates that micro-second cooling for muons is possible under somewhat optimistic expectations for the precision 
of the beam optics and an optimistic extrapolation for the power available in low-noise, high-bandwidth amplifiers. 
Furthermore, the bunch charge Nb ~ O(10 10 ) assumed in this study is smaller by an order of magnitude or two than 
what is discussed in current proposals for a muon collider, although because of the additional stochastic cooling, the 
final luminosity can be just as high. 

Now that concerns about potentially devastating quantum mechanical limitations have been assuaged, the possibility 
of ultra-fast cooling rates appears sufficiently promising to justify more careful investigation. Attention can turn to of 
the more prosaic classical effects that might limit performance, in order to include important physics so far neglected. 

A more realistic model of the classical wiggler radiation should be used, with the effects of off-axis particle injection and 
diffraction included. A careful classical calculation will be somewhat complicated by the unusual parameter regime 
of OSC wigglcrs, involving high magnetic field strengths and a relatively small number of relatively long-wavelength 
poles. The effects on particle orbits of angular deviations and oscillating components of the longitudinal motion in a 
planar wiggler arc not necessarily small, but have so far been neglected. Most critically, sensitivity to the errors in 
the bypass lattice optics must be investigated, the effects of only partial mixing between passes or unwanted mixing 
within passes addressed, the Guoy phase included, and the effects of non-uniform gain and optical dispersion in the 
amplifier realistically treated. 

Many technological challenges need to be addressed and assessed: the beam stretching needed to reversibly increase 
bunch length and decrease energy spread to manageable levels before cooling must be accomplished on microsecond 
time-scales; as must the re-compression after cooling, which is even more difficult since there will by design be 
less energy spread to be exploited in the high-dispersion lattice. The bypass and kicker optics must be carefully 
engineered, sufficiently adjustable to calibrate particle orbits within a fraction of a micron tolerance, and presumably 
be dynamically stabilized and controlled through active feedback. The optical amplifiers must be robust, stable, 
single-pass (non-regencrative) , highly linear even at high power, variable gain but with high peak gain, low-noise, and 
probably actively-cooled. Mixing (effective shifting longitudinal particle positions on the scale a radiation wavelength 
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or more) must be very thorough between passes the cooling sections, but negligible between the pickup and cooler of 
any given cooling section. 

Now, at least, it is confirmed that such cooling appears in principle completely consistent with quantum mechanical 
features of radiation and amplification. It remains to be seen if it is practical in the face of these more mundane but 
important constraints. 
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[65] This is what we will shortly recognize as the coherence length of the radiation, not the much longer wiggler length 
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[66] It might also be possible to work at an harmonic of this fundamental, but this case will not be considered here. 

[67] Coherent here refers to radiation that is is approximately diffraction-limited transversely and approximately Fourier- 
limited longitudinally or temporally. The radiation from a single particle into this bandwidth and in this transverse mode 
is coherent in this sense, but the radiation from the bunch as a whole is incoherent in general, consisting of approximately 
■A longitudinal modes, essentially randomly-phased with respect to each other, where N u Xo is the so-called coherence 
length, proportional to the inverse bandwidth. 

[68] Actually, a particle trailing another particle by any distance will see almost none of the other particle's radiation, since the 
radiation is largely confined to a small forward angle by relativistic effects, and the faster radiation slips forward relative 
to the particles, while a particle leading another particle by about N u Xo will interact with only a small fraction of that 
particle's radiation that slips sufficiently far ahead over the finite length of the kicker wiggler, so the effective sample length 
L 3 will be somewhat shorter than N u Ao, as will be seen below. 

[69] Note that the independent Poissonian photon emission model, in which all emissions are assumed uncorrelated, can predict 
any average emitted photon number, while the variance in photon number is always proportional to the mean. At least for 
the amplifier noise, we would imagine something closer to a so-called chaotic or thermal state, where due to interference 
effects the statistics are not those of shot noise, but rather the standard deviation is proportional to the mean. This is the 
first hint that something might be wrong with the reasoning advanced here.... 

[70] At this heuristic level, fortunately we can simply ignore the well-known technical difficulties associated with defining a 
self-adjoint phase operator in quantum mechanics conjugate to the usual bosonic number operator. 

[71] While the energy of a spin state can be made arbitrarily large by increasing the magnetic field, the action associated with 
any spin states of a single lepton is bounded in magnitude by and so cannot participate in any sort of Correspondence 
Principle limit. See the discussion in the sequel. 

[72] Incidentally, this makes the prospects for using optical stochastic cooling to perform beam polarization look rather bleak. 
Actually, partially spin-polarized particle beams have been achieved, and would seem to offer a counter-example to the 
Bohr-Mott Conjecture, except that only collective spin observables on many particles are ever really measured, which can 
behave classically. 

[73] Curiously, there is an apparently consistent formalism for evolving matter quantum mechanically but radiation classically 
which includes radiation reaction effects, the so-called neoclassical radiation theory of E. T. Jaynes, which can capture 
many effects, such as the photoelectric effect and the Lamb shift, that are purported to be evidence for the quantum 
nature of light. However, some predictions of quantum optics, such as perfect anti-correlation after a beam-splitter for 
single photon states, or photon polarization states that violate Bell's inequalities, cannot be described in this model. 
Anyway, here we are concerned with the opposite "hemi-classical" limit, with particles treated classically and radiation 
quantum mechanically. 

[74] We will also see that within the hemi-classical formalism, the behavior and observable properties of the radiation will 

actually be almost entirely classical, so the hemi-classical theory is semi-classical in the first sense of the term, but 

complementary to the second, widely-used meaning. 
[75] As we have seen, in some regimes, collective Langmuir oscillations might be marginally appreciable, but as we are interested 

in a fundamental proofs-of-principle regarding the effects of radiation noise, these collective effects will be subsequently 

ignored. 

[76] It seems unlikely than any fully general, Hamiltonian theory of macroscopic quantum optics can be established, because 
the energy cannot always be written as a well-defined function of the macroscopic fields and free sources at one instant of 
time. 

[77] Disagreement continues over whether the electric field E(x,t) or the displacement field D(x,t) should be canonically 
conjugate to the vector potential A(x, t). Also, in the microscopic theory it is well known that with the addition of sources, 
the commutation relations of the field observables are unchanged, and commute with all observables for the material 
degrees-of-freedom, which does not always seem to be the case with the quantized macroscopic theories. These questions 
are not fully resolved, but [64] suggests that some of the disagreement may just be due to different gauge choices, or to 
different dressed operators going by the same name. Anyway, this mostly concerns the meaning of "photons" and the 
nature of measurements performed inside the dielectric medium, which are of minor concern here. 

[78] We have worked out in detail the case where the quantization region is simply-connected and simply-bounded. Some 
subtleties arise if it is not simply-connected, because then the kernel of the curl operator can contain certain vector fields 
which are not gradients of a single- valued scalar field, reflecting the non-trivial cohomology of the interior region. However, 
generalizations of the Helmholtz-Hodge Decomposition Theorem can be used in these cases, and this is a subject of ongoing 
study on our part. 

[79] We had worked out this particular quantum theory of dielectrics in some detail, before realizing it was nearly identical to 

that developed by Glauber and Lewenstein in |f 8| and later by other authors. 
[80] Following both Caves[45 , and general conventions for paraxial optics, we labeled the modes by the frequency ui at which 

they would oscillate if allowed to freely evolve, because in general they are not characterized by a true fixed wavenumber. 

The explicit dependence on oj is simply a label, and is not meant to indicate that these are in general Fourier transforms 

of time-domain quantities. In fact, we remain in the time-domain, where functions labeled by ui nevertheless evolve in t, 

non-trivially in the presence of free sources. 
[81] Necessity is now widely recognized. Sufficiency is sometimes still debated for very weak or occasionally other states. But 

any discreteness or other quantum signatures in the measurements of such a dim radiation state can always be attributed 

to quantum mechanical features of the detectors rather than the fields, so we stand by sufficiency as well. 
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[82] The results here can be simply generalized in the obvious manner to multiple discrete modes, or even to continuum modes 
where the expressions will involve quasi-distribution functionals, but the notation involves functional derivatives and path 
integrals and gets rather cumbersome. 

[83] Viz a viz the radiation of concern to us, each particle radiates into a single coherent wiggler mode consisting approximately 
of a diffraction- limited and Fourier-limited wave packet of length N u \ u , but the eigenmodes of the optical system were 
taken to be time-harmonic, so many of these modes are required to represent the coherent wiggler mode. However, only part 
of the zero-point noise from each of these harmonic eigenmodes appears over the actual extent of the radiated wave-packet 
felt by any given electron, so we can still count roughly one mode in the "half photon per mode" noise rule. 

[84] We will follow standard convention and refer to the quantum mechanical uncertainties in the vacuum state or coherent states 
qua displaced vacuum states as "vacuum fluctuations" or "zero-point motion," even though the mathematical formalism 
clearly identifies them as predictive uncertainties, not physical fluctuations. 

[85] As a corollary, we deduce that if somehow only one quadrature component is amplified, then no additional half photon of 
noise at input need be added, beyond that represented in the original "vacuum fluctuations." 

[86] M. Zolotorev should be credited with instigating both approaches to the problem. After he suggested this investigation 
to us, and our answers were not forthcoming with sufficient alacrity, he went in search of a theoretical mercenary with 
bigger mathematical weapons at his disposal. Our simpler treatment based on coherent states did uncover a mistake in 
preliminary calculations by Heifets and Zolotorev. 



